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1 Introduction 



Karl Pearson (1895), in the context of fitting curves to real data, introduced his famous 
family of frequency curves by means of the differential equation 

f'(x)_pi(x) 

f(x) P2(x) ' 

where / is the probability density and pi is a polynomial in x of degree at most i, i = 1,2. 
Since then, a vast bibliography has been developed regarding the properties of Pearson 
distributions. The original classification given by Pearson contains twelve types (I-XII), 
although this numbering system does not have a clear systematic basis; Johnson et al. 
(1994), p. 16. Craig (1936) proposed a new exposition and chart for Pearson curves; 
however, a more reasonable and convenient classification is included in a review paper by 
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Diaconis and Zabell (1991). Extensions to discrete distributions have been introduced by 
Ord (1967) and an extensive review can be found in Ord (1972), Chapter 1. 

In this paper we present and review a number of properties satisfied by the distributions 
of the Pearson family and the associated Rodrigues polynomials, the polynomials that are 
produced by a Rodrigues-type formula. Our main focus is on a suitable subset of Pearson 
distributions, the Integrated Pearson Family, because this class subsumes all interesting 
properties related to the associated orthogonal polynomial systems. For example, it will 
be shown in Section 4 that orthogonality of Rodrigues polynomials with respect to an 
ordinary Pearson density / results to an equivalent definition of the integrated Pearson 
system. This consideration entails an alternative classification of (integrated) Pearson 
distributions, which is essentially the one given in Diaconis and Zabell (1991). 

In the context of deriving variance bounds for functions of random variables, Afendras 
et al. (2007, 2011) and Afendras and Papadatos (2011) have made use of the following 
definition, which provides the main framework of the present article. 

Definition 1.1 (Integrated Pearson Family). Let X be an absolutely continuous random 
variable with density / and finite mean /! = EX. We say that X (or its density) belongs 
to the integrated Pearson family (or integrated Pearson system) if there exists a quadratic 
polynomial q(x) = 8x 2 + fix + y (with 8, j8 , y 6 R, | 8\ + |j8 1 + | y| > 0) such that 



This fact will be denoted by X ~ IP(/i;g) or / ~ IP(/i;g) or, more explicitly, X or / ~ 



Despite the fact that the integrated Pearson family is quite restricted, compared to the 
usual Pearson system - see Proposition 2.1(iii), below - we believe that the reader will 
find here some interesting observations that are worth to be highlighted. The integrated 
Pearson system satisfies many interesting properties, like recurrences on moments and on 
Rodrigues polynomials, covariance identities, closeness of each type under particularly 
useful transformations etc.; such properties are by far more complicated (if they are, at 
all, true) for distributions outside the Integrated Pearson system. These features should be 
combined with the fact that the Rodrigues polynomials form an orthogonal system for the 
corresponding Pearson density if and only if the density belongs to the Integrated Pearson 
family. In other words, the Rodrigues polynomials and, consequently, the ordinary Pear- 
son densities, are useful only if they are considered in the framework of the Integrated 
Pearson system. To our knowledge, these facts have not been written explicitly elsewhere. 

The paper is organized as follows: In Section 2 we provide a detailed classification of 
the integrated Pearson family. It turns out that, up to an affine transformation, there are six 
different types of densities, included in Table 2.1. We also provide conditions guarantee- 
ing the existence of moments, and we give recurrences as long as these moments exist. In 
Section 3, a detailed comparison between the integrated Pearson family and the ordinary 
Pearson system is presented. Interestingly enough, there exist a simple algorithm that en- 
ables one to decide whether a given ordinary Pearson density belongs to the integrated 




(1.1) 



IP(/i;5,j3,y). 



Integrated Pearson family and Rodrigues polynomials 



3 



system, or not. In Section 4, exploiting a result of Diaconis and Zabell (1991), we show 
that (under natural moment conditions) the first three Rodrigues polynomials (of degree 
0, 1 and 2) are orthogonal with respect to an ordinary Pearson density if and only if this 
density belongs to the integrated Pearson system. Finally, in Section 5 we provide recur- 
rences between the orthonormal polynomials and their derivatives; in fact, the derivatives 
themselves are orthogonal polynomials with respect to other integrating Pearson densi- 
ties, having the same quadratic polynomial, up to a scalar multiple. Although we do not 
include any specific applications of these results here, we notice that such recurrences are 
particularly useful in obtaining Fourier expansions of the derivatives of a function of a 
Pearson variate. The main result of Section 5 is given by Corollary 5.4. It provides an 
explicit relation (in terms of /i and q) between the m-th derivative of an orthonormal poly- 
nomial of degree k^ m and the corresponding orthonormal polynomial of degree k — m. 
That is, it relates the orthonormal polynomial system, associated with some / ~ IP(/i;g), 
to the corresponding orthonormal polynomial system associated with the 'target' density 
f m ~q m f. 

In the sequel and elsewhere in this article, X ~ IP(/i; <5,/3, 7) means that X has finite 
mean /1, and that X admits a density / (w.r.t. Lebesgue measure on R) such that (1.1) is 
fulfilled. Define the open (bounded or unbounded) interval 

J = J(X) := (essinf(X),esssup(X)). (1.2) 

If F is the distribution function of X then J = ((Xf,0)f) = (a, ft)), say, where OCf '■— inf{x : 
F(x) > 0}, (Of := supjx : F(x) < 1}. It is clear that (1.1) takes the form = whenever 
x = p is a zero of q that lies outside the interval (a, ft)); thus, f(p) may assume any value 
in this case. However, in order to be specific, we can redefine f(p) = at such points p, 
if any, without any loss of generality. Therefore, we shall use this convention through the 
whole article without any further reference to it. 

2 A complete classification of the Integrated Pearson family 

We show in this section that the Integrated Pearson family contains six different types 
of distributions. These are classified in terms of the corresponding quadratic polynomial 
q(x) = 8x 2 + fix + 7 and its discriminant A = /3 2 — 48y as follows: Type 1 (Normal- 
type, 8 = /3 = 0); type 2 (Gamma- type, 8 = 0, /3 7^ 0); type 3 (Beta- type, 8 < 0); type 
4 (Student-type, 8 > 0, A < 0); type 5 (Reciprocal Gamma- type, 8 > 0, A = 0); type 6 
(Snedecor-type, 8 > 0, A > 0). The first three types (with 5^0) consist of the well-known 
Normal, Gamma and Beta random variables and their linear transformations; the last three 
types (with 8 > 0) consist of some less familiar distributions (see Table 2.1, below); they 
have finite moments up to order 1 + i — £ (for any £ > 0) while E|X| 1+1//(5 = °°. The 
proposed classification is very similar to the one given by Diaconis and Zabell (1991), 
Table 2 and pp. 294-296. 

We start with an easily verified proposition. 

Proposition 2.1. LetX ~IP( i u;g) and set/ = (a, ft)) = (essinf(X), esssup(X)). Then, 
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(i) f(x) is strictly positive for x in 7 and zero otherwise, i.e., {x : f(x) > 0} = 7; 

(ii) / G C°°(7), that is, / has derivatives of any order in 7; 

(iii) X is a (usual) Pearson random variable supported in 7; 

(iv) q(x) = 8x 2 + fix + 7 > for all x G 7; 

(v) if a > — oo then q(a) = and, similarly, if ft) < +°° then q(a>) = 0; 

(vi) for any 0,c G R with 0^0, the random variable X := 6X + c ~ IP(/i;^) with 
jl = 6/l+c and <f(x) = 2 <?((x-c)/0). 

Proof. By (1.1), x q(x)f(x) is continuous. On the other hand, from the definition 
of 7 = (05^,0)/-) = (oc,(o) it follows that q(x)f(x) must vanish for all x ^ a (if any) 
and for all x ^ ft) (if any). Also, it must be strictly positive for x G 7. Indeed, if x G 
(jU,G)) then q{x)f{x) = f™(t - n)f(t)dt ^ (x- -F(x)) > 0; if x G (a,ju) then 
<?(x)/(x) = /fj/i -0/(0* ^ (J" -0^(0 > 0; Anally, <?(/0/(/0 = ^E|X-ju| > 0. 
Thus, q(x)f(x) > for all x G (a, ft)). Since q is continuous and has no roots in 7 it fol- 
lows that both q(x) and f(x) are strictly positive (and continuous) in 7. The vanishing of 
qf outside 7 shows that f(x) = for all x ^ 7, with the possible exception at the points 
x ^ 7 which are real roots of q. Clearly, if p G R \ (a, ft)) is a zero of q we can redefine 
/(p) = 0, if necessary, so that (i) and (iv) follow. On the other hand, / : (a, ft)) — » (0,°°) 
is C°°(7). Indeed, writing p\(x) = /! — x — q\x) (a polynomial of degree at most one) we 
see from (1.1) that / : 7— > (0,°°) is continuous and thus, 

/'« = /W^tt or ' equivalent^, ^ = * L ~ X ~* {X \ x G 7. (2.1) 
q[x) f{x) q{x) 

This proves (iii). Moreover, (2.1) shows that /' is continuous in 7 and, inductively, that 
y(n+l) : y _i. |^ j s continuous, since for x G 7, 

Now (vi) is straightforward and it remains to show (v). To this end, assume that ft) < oo. 
Since q{co) = \im x ^ (0 q(x) and q(x) > for x in a left neighborhood of ft), it follows that 
q(co) ^ 0. Assume now that q{(6) > and define 

Ai := inf {g(x)}>0, A2 := sup \}i—x — q'{x)\<°°. 



Then, for all x G [jU, ft)) 



L 



*H-t-q>(t) dt 



H q{t) 



: x \LL-t-q'(t)\ , /"«» \u-t-q'(t)\ , , N A 2 
9(0 ^ 9(0 M 
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Setting A := (o) — ju)^ < °° and observing that 



ln/(x) = ln/(/i) + J* f -^±dt = ln/Gu) + 



•* jl-t-qf{t) 
■ " q(t) 



dt, x e [/i, to), 



we have 



ln/(x)|^|ln/(/i)|+A:=c< 



oo 



Therefore, there exist constants c\ , C2 such that < c\ ^ /(jc) ^ C2 < 00 for all x G [jU, co). 



which contradicts the assumption q(co) > 0. The case a > —°° is reduced to the case 
CO < 00 if we consider the random variable X = —X with mean Ji = — /1 and support 
J(X) = (a, co) = (-co, —a). According to (vi), its density / satisfies (1.1) with quadratic 
polynomial q(x) = q(—x). Thus, if a > — °° then co < °° and q(a) = q(—cc) = q(co) = 



Corollary 2.1. Let X ~ IP(/i;g) and assume that a = essinf(X) and co = esssup(X) 
are the lower and upper endpoints of the distribution function of X. Then, the support 
of X (or of its density /) S(f) = S(X) := {x : f(x) > 0}, equals to the open interval 
J = J(X) = (a, co). This interval support has the following two properties: 

(i) J Q S+(q) := {jc : q(x) > 0} and 

(ii) 7 is a maximal open interval contained in S + (q), i.e., for any open interval J C S + (q) 
it is true that either JCJorJ(~)J = 0. 

In other words, the support J of X can be taken to be an open interval that coincides 
to some connected component of the open set {x : q(x) > 0}. Since q is a polynomial 
of degree at most two, it is clear that the set {x : q(x) > 0} has at most two connected 
components. For example, if q(x) = x 2 then either J = (— °°,0) or J = (0,°°); if q(x) = 
x 2 — l then either 7= (— °°, — 1) or 7= (I, 00 ); if q(x) = 1— x 2 then7= (— 1,1); if q{x) =x 
then 7 = (0,°°); if q(x) = 1 +x 2 or q(x) = 1 then 7 = R. Since, however, EX = /1 e 7, 
any particular choice of ji e {x : q(x) > 0} characterizes the support 7 of X. We say that 
q(x) = 8x 2 + fix + 7 is admissible if there exists /1 G R such that /1 G {x : q(x) > 0}; 
thus, {x : q(x) > 0} ^ whenever q is admissible. In the sequel we shall show that 
for any admissible choice of q and for any /1 G {x : q(x) > 0} there exists an absolutely 
continuous random variable X with density / such that EX = /! and (1.1) is fulfilled. 
Moreover, it will become clear that / is characterized by the pair (n;q). Therefore, the 
notation X ~ IP(/i;g) or, equivalently, / ~ IP(/i;g), has a well-defined meaning. 

The proposed classification distinguishes between the cases 5 = 0, 8 < and 8 > 0, 
as follows: 

2.1 The case 5 = 



Thus, 




CO 



0. 



□ 



We have to further distinguish between the cases /3 = and /3 ^ 0. 
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2.1.1 The subcase 8 = 0, /3 = 

Since q(x) = y and q is admissible we must have 7 > 0. Therefore, J(X) — R. Fixing 
/I G R and solving the differential equation (2.1) we get 

, \ 1 (-'-M) 2 

f(x)= vm e 27 ' xeR ' 

i.e. X ~ N(n,o 2 ) with a 2 = 7. 

2.1.2 The subcase 8 = 0, /3 7^ 

Assume that g(jc) = /3x + 7 with /3 7^ and fix a number /1 G {jc : <?(jc) > 0}; that is, 
q{fi) = /3/i + 7> 0. According to Proposition 2.1(vi) we may further assume that /3 > 0, 
7=0 and /i > 0; otherwise, it suffices to consider the random variable X = JL (X + ^ ) with 

q(x) = |/3 \x and EX = jl = 4r (/i + ^) = > since q(fj.) > 0. Now, since q(x) = fix 
with /3 > we must have 7(X) = (0, °°) . Fixing /1 > and solving the differential equation 
(2.1) we get 

(1/^)M/^ /j8 _ _ Vjg 
/W r(M/j8) ' 

That is, X ~ r(a, A) with a = /i//3 > and A = 1//3 > 0. Hence, a linear non-constant ^ 
corresponds to a linear transformation, X = 0X + c, 7^ 0, of a Gamma random variable 
X, i.e., to Gamma-type distributions. 

2.2 The case 8 < 

Since 8 < and {x : q(x) > 0} must contain some interval it follows that the discriminant 
/3 2 — 4^7 of q must be strictly positive. If pi < p2 are the real roots of q we can write 
q(x) = 8(x — p\)(x — P2) so that the support of X is the finite interval 7(X) = (pi,p2). 
Now we show that for any choice of jU G (pi , P2) there exist a (unique) random variable X 
withX ~ IP(/i;g). To this end, it suffices to examine the particular case q(x) = — 8x(l —x) 
and < jU < 1 ; the general case is reduced to the particular one if we consider the random 
variable X = (X — pi)/(p2 — Pi)- Fixing /1 G (0, 1), q(x) = — 8x(l —x) and solving the 
differential equation (2.1) on 7(X) = (0, 1) we get 

fir) = r-M/S-lfl-r^-U-M)/*-! < X < 1 

that is, X ~ B(a,b) with a = n/\8\ > 0, b = (1 ~n)/\8\ > 0. It follows that the case 
8 < corresponds to a linear transformation of a Beta random variable, the Beta-type 
distributions. 

2.3 The case 8 > 

We have to further distinguish between the cases where the discriminant A = /3 2 — 48y is 
positive, zero or negative. 
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2.3.1 The subcase 8 > 0, A < 

Since q has no real roots, J(X) — R. Thus, jU G R can take any arbitrary value. Also, q 
has the form q{x) = 8(x — c) 2 + with 8 > 0, > and c G R. Without loss of generality 
we further assume that c = 0; otherwise we can consider the random variable X = X — c. 
Fixing /i G R, q(x) = Sx 2 + and solving (2.1) one finds that 

f(x) = pexpf tan" 1 (x a/570) ) , x G R. 

The normalizing constant C = C^(<5, 0) can be calculated explicitly when \x = 0: 

f(l + l/(2g))v^W 

r(1/2 + 1/(25))v ^ ■ 

Therefore, the quadratic polynomial q(x) = 8(x — c) 2 + with 8 > and > cor- 
responds to Student-type distributions centered at c, provided that /i = c; otherwise, i.e., 
when /! 7^ c, it corresponds to some asymmetric, say sfew Student-type, distributions. 



2.3.2 The subcase 8 > 0, A = 

Since q has a unique real root at p = —[5/(28), it follows that q(x) = 8(x — p) 2 and, 

therefore, the support J(X) is either (— °°,p) or (p,°°), according to /i < p or /i > p, 

respectively. Without loss of generality we may assume that q(x) = 8x 2 with 8 > and 

/i > 0; otherwise, it suffices to consider the random variable X = ^ _ p , (X — p). Now, 

IM PI 

setting J(X) = (0,°°), q(x) = Sx 2 (8 > 0) and /i > in eq. (2.1) we get the solution 

/(*) = YU x ~ a ~ le ~ X/x ^ * >0 ' 

where A = /i/<5 > and a=l + l/5>l. Observing that 1/X ~ r(a, A) it follows that 
the case 5 > 0, A = corresponds to Reciprocal Gamma-type distributions. 



2.3.3 The subcase 8 > 0, A > 

Assuming that pi < P2 are the roots of q we can write q(x) = 8(x — pi)(x — p?) and 
the support J(X) has to be either (— °°,pi) or (P2, 00 ), according to /i < pi or /i > P2, 
respectively. By considering the random variable X = — (X — pi) when /i < pi and the 
random variable X = X — p2 when /i > P2 it is easily seen that both cases reduce to Ji > 0, 
J(X) = (0,oo) and ^(x) = + 0) with 5 > and = p 2 - Pi > 0. Thus, there is no 
loss of generality in assuming ji > 0, J(X) = (0,°°) and q(x) = 8x(x + 0) with 5 > and 
>0. Then, (2.1) yields 

with a = 1 + ^ > 1 and £> = ^ > 0. Equivalently, ~ It follows that the case 

8 > 0, A > corresponds to Snedecor-type distributions. 

All the above possibilities are summarized in Table 2.1, below; compare with Table 2, 
p. 296, in Diaconis and Zabell (1991). 



Table 2.1: Densities of the Integrated Pearson family IP(/i; <5,/3, y) = IP(/i;g).* 



density f(x) support q(x) parameters mean u classification 

usual notation rule 

1. Normal-type , (*-m) 2 

_ „, 2 , — L=e R ff 2 y=c7 2 >0 AteR c5 = j3=0 

2. Gamma-type „ , , , x „ a _ , 

X~na t X) ^ X £ (0 ' +00) X a ' A> ° A >0 * = °'^° 

3 ' Beta T«. n (0,1) ^ «>*>° A>0 * = -l-<0 

X~B(a,b) B ( a ' b > a + b a + b a + b 

c r , m -yfry) \ 8>Q 



4. Student-type Lexp l 2 TO *2 s n td 0>U 

1 - t~i — - H 5x 2 + r 5,y>0 jueia i3 2 <4<5y 



5. Reciprocal X c , _i x 2 A /3 2 =4<5y 

„ t4tI " 'e » (0,+°°) «>1,A>0 ->0 , p 

Gamma-type r <") v ' a-\ a-\ ^~T(a,A) 



6. Snedecor-type , w „,_ a _ 6 , . x(x+9) a >\, b9 j3 2 >4<5y 



A random variable X belongs to the Integrated Pearson family if and only if there exist constants C[ ^ and ci £ R such that the density of X = c\X + ci_ is contained in the table. 
For n > 1 and if 11 = and 5 = ^ = y - then X ~ f„ . 

2 c = c„(s,r) > 0, with c (5,r) =r (1 + v/sr^ /r (| + 0?. 



00 
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Remark 2.1. Since 

exp (^^(V^Tr)) d exp (^ tan_1(x ^7^ ) ) 
(/i x) (5jc 2 +r) 1+ ^ ^ (5jc 2 +r)2? 

it follows that EX = /! for the Student-type densities (type 4), while for all other cases it 
is evident to check that the mean is as displayed in Table 2.1. Next, it is easily verified 
that the densities of Table 2.1 satisfy the assumptions (B) of Proposition 3.3, below, with 
/l = EX, pi{x) = q(x) and p\(x) = /i — x — q'(x), where jj, and q are as in the Table. 
Hence, according to Proposition 3.3, all these densities are, indeed, integrated Pearson. 

Corollary 2.2. Assume thatX ~ IP(jU;5,/3,7). 

(a) If 5 < then E|X| a < °o for any a e [0,°°). 

(b) If 5 >0thenE|X| a <ooforany ae [0, 1 + 1/5), while E|X| 1+1 / 5 = °°. 

Proof. If X ~ IP(/i; 5,/3, 7) then we can find constants c\ ^ and C2 G R such that the 
density of X = c\X + C2 is contained in Table 2.1. Then, according to Proposition 2.1(vi), 
X ~ IP(/i; <5,/3, 7) with 5 = 5. The assertion follows from the fact that E|X| a < °° if and 
only if E\aX + c 2 \ a < °°. □ 

Next, we shall obtain a recurrence for the moments and the central moments of a 
random variable X ~ IP(/i;g). To this end we first prove a simple lemma. 

Lemma 2.1. If X ~ IP(/i;<5,/3,7) has support J(X) = (a, on) and E\X\" < °o for some 
n ^ 1 (that is, 5 < ^) then 

lim x k q(x)f(x) = lim x k q(x)f(x) = 0, = 0, 1, . . . ,n- 1, (2.2) 

and, in general, for any c G R, 

lim (x-cfq{x)f{x) = lim (x-c) k q(x)f(x) = 0, = 0, 1, . . . ,n - 1. (2.3) 

Proo/ Since x k q(x)f(x) = x k -t)f(t)dt, a < x < a), the second limit in (2.2) is 
trivial whenever a > — °° and the first one is trivial whenever 00 < °°. If (0 = °° it suffices 
to verify the first limit in (2.2) only when k = n — 1 and n ^ 2 (because the case = 
is obvious); then, since q(x)f(x) is eventually decreasing we have that for large enough 

x > 0, 

x- l q(x)f(x) = q(x)f(x) {n ~_ ^ - f f - 2 df 

2"-!-l A/2 yw7W 

— D2' 7-1 f 00 
^ ,„_! 1 /, f"" 2 9(0/(0*-^0, asx^oo, 

1 >/x/2 
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because deg(g) ^ 2 and, by assumption, ~Eq(X) \X\"~ 2 < °°. The case a = — °° is translated 
to the previous one by considering the random variable X = —X with density f(x) = 
/(— x), quadratic polynomial q(x) = q(—x) and support J(X) = (ct,G>) = (— £0, — <x) = 
(-Q),~). Then E|X|" = E|X|" < oo and 

lim x k q(x)f(x) = (-l) k lim x k q(-x)f(-x) = (-l) k KmJq(x)f(x) = 

X— >— oo X— >oo X— >oo 

for all e {0, 1, ... ,n — 1}. Now it suffices to observe that all limits in (2.3) are linear 
combinations of limits in (2.2). Indeed, the first limit in (2.3) is 

\im x ^ co (x-c) k q(x)f(x)=L- =0 ())(-c) k -'\hn x ^ a) x'q(x)f(x)=0 

and, similarly, the second limit in (2.3) is 

hm xSia (x-c) k q(x)f(x) =Ef =0 ©(-c) fe - ; lim xNa x^(x)/(x) = 0. □ 

Lemma 2.2. If X ~ IP(/i; 5, J3, y) and E|X|' 1 < oo for some n ^ 2 (that is, 5 < ^) then 
for any c E R, the central moments about c satisfy the recurrence 

Mx _ c) k+l = (V-c + kq>(c))E(X-c) k + kq(c)E(X-c) k - 1 

[ ' l-k8 ' (2.4) 

k = 1,2, . . . ,n— 1, 



with initial conditions E(X — c)° = 1, E(X — c) 1 = /i — c, where <?(c) = 8c 2 + /3c + y, 
</(c) = 25c + /3 . In particular, 

(i) the usual moments (c = 0) satisfy the recurrence 

Hltl=M H^ t=u (25) 

with initial conditions EX = 1 and EX 1 = /i; 

(ii) the central moments (c = /i) satisfy the recurrence 

E(x _ M = k^)nx-^ + kc,wnx-^-^ (2 . 6) 

1 — 

with initial conditions E(X - ju)° = 1 and E(X - fx) 1 = 0. 
Proof. If J(X) = (or, a)) is the support of X and k E {1,2, ...,n— 1} then we have 
E(X - c) k+l = E[((ju -c) - Qi -X))(X - c)*] 

f£0 



(Ai-c)E(X-c) A '- / (x-c^Gu -jc)/(x)cU. 
ia 
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Using (2.3) and the fact that q(X) = S(X - c) 2 + q'{c){X -c) + q(c) we see that 

{■CO /-CO 

- ( x - c ) k (n-x)f(x)dx=- (x-c) k (q(x)f(x)Ydx 
J a J a 

= -(x- c) k q(x)f(x) 1 1 + kEq(X)(X - c)^ 1 

= k8E(X - c) k+l + kq'{c)E{X - c) k + kq(c)E(X - c) k ~\ 

Therefore, 

( 1 - k8)E(X - c) k+l = (ft - c + kq'(c))E(X - c) k + kq(c)E(X - c)^ 1 , 

k = 1,2, ... ,n — 1, 

and, since the initial conditions are obvious, (2.4) follows. □ 



3 Comparison with the ordinary Pearson system 

The ordinary Pearson family consists of absolutely continuous random variables X sup- 
ported in some (open) interval (a, ft)), such that their density /, which is assumed strictly 
positive and differentiable in (a, ft)), satisfies the Pearson differential equation 

§4 = £iM, a<x<a , (3.1) 

f[x) P2{X) 

where p\ is a polynomial of degree at most one and P2 is a polynomial of degree at 
most two. Since we can multiply the nominator and the denominator of (3.1) by the 
same nonzero constant, it is usually assumed, for convenience, that p\ is a monic linear 
polynomial of degree one, e.g., p\(x) = x + aQ. Although this restriction specifies both 
pi and p2 whenever p\ is non-constant, it is not satisfactory for our purposes because it 
eliminates all rectangular (uniform over some interval) distributions and several B(a,b) 
densities (those with a + b = 2) - see Table 2.1, above. Therefore, when we say that a 
function / satisfies the Pearson differential equation (3.1) it will be assumed that p\ is 
any polynomial of degree at most one (the cases p\ = and p\ = c ^ are allowed) and 
P2 ^ is any polynomial of degree at most two. Note that common zeros of p\ and p2 
are allowed inside the interval (a, ft)). Also, it may happen that p\ and p2 have common 
zeros outside the interval (a, ft)); this is the case of an exponential density. 

Clearly, the ordinary Pearson family contains some random variables whose expecta- 
tion does not exist, e.g., Cauchy. Sometimes it is asserted that, under finiteness of the first 
moment, (1.1) and (3.1) are equivalent - see, e.g., Korwar (1991), pp. 292-293. However, 
this is true only in particular cases, i.e. when we have made the 'correct' choice of p2 and 
provided that a solution / of (3.1) is considered in a maximal subinterval of the support of 
P2, {x : P2(x) 7^ 0}. The following algorithmic procedure will always decides correctly if 
a given Pearson density belongs to the Integrated Pearson family. The algorithm makes a 
correct choice of p 2 , if it exists, as follows: 
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The Integrated Pearson Algorithm 

Step 0. Assume that a Pearson density / with finite (unknown) mean and (known) support 
S(f) = {x : f(x) > 0} = (a, ft)) satisfies f'/f = p\/pi for given (real) polynomials 
pi, P2 (with p2 ^ 0), of degree at most one and two, respectively. 

Step 1. Cancel the common factors of p\ and p2, if any. Then the resulting polynomials, 
say p^ and p^\ have become irreducible - they do not have any common zeros in 
C. In case p\ = it suffices to define pr^ = 0, p^ = 1- 

Step 2. If a > —°° and p£ {&) ^ then multiply both p~^ and p^ by x — a and name 

~(2) ~(2) 

the resulting polynomials p\ ' and p^ , otherwise (i.e. if either a = —°° or a > —°° 
and p£' (a) = 0) set pf 1 = p^ and p^ = p^ ■ 

Step 3. If (0 < oo and p^\(j}) ^ then multiply both p^ and pip by ft) — x and name 
the resulting polynomials p\ and p2', otherwise (i.e. if either ft) = °° or oo < °° and 

A 2 )/ \ m * ~(2) , ~(2) 

j!?2 \CO) = 0) set pi = p\ ma P2 = p 2 ■ 

Step 4. If the resulting polynomials p\ and /?2 satisfy the conditions deg(pi) ^ 1 and 
deg(j!72) ^ 2 then p2 is a correct choice and / ~ IP(/i;g) with ^(jc) = 0p2(x) for 
some 0^0; otherwise the given density / does not belong to the Integrated Pearson 
system. 

It is clear that the above procedure starts with the equation f'/f = p\/p2 and, at 
Step 3, it produces two new (real) polynomials pi,p%, of degree at most three and four, 
respectively, such that /'// = p\j P2- Moreover, the polynomial p2 satisfies the relations 
P2{oc) — if a > -oo, P2(g>) = if £0< oo and pi{x) ^ for all x G (a, ft)). Furthermore, 
because of Step 1, the polynomials p\{z) and P2(z) cannot have any common zeros in 
C \ {a, co}. 

The algorithm guarantees that we have chosen a correct p2 in each case where such a 
P2 exists. For example, the standard exponential density, 

f(x) = e- x , x>0, 

satisfies (3.1) when (pi,p2) = ( — 1 , 1 ) , when {px^pi) = (—x,x) and when (j)\,pi) = 
(— x— l,x+ 1); the correct choice is the second one. The standard uniform density, 

/(x) = l, 0<*<1, 

satisfies (3.1) for p\ =0 and for any p2 (with no roots in (0, 1)), and the correct choice is 
P2 = x( 1 — x) . The power density, 



f(x) = 2jc, < x < 1 , 
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satisfies (3.1) with (pi, pi) = (2 — x,x(2 — x)) and the correct choice arises when we mul- 
tiply both polynomials by (1 — x)/(2 — x), that is, (p\,P2) = (1 — x,x(l — x)). The Pareto 
density, 

2 

(x+l) s 

satisfies (3.1) when (p\,P2) = (—3,x + 1), when (p\,pi) = (— 3x,x(x + 1)) and when 
{p\,Pi) = (— 3(x + 1), (jc+ l) 2 ); the correct choice is the second one. The half-Normal 
density, 



satisfies (3.1) in its interval support (a, to) = (0,°o), although it does not satisfy (1.1) - 
there not exists a correct choice for p2- A more natural example is as follows: Consider 
the density 

fix) = —p , a<x<co, 

Vl+x 1 

where C = C(a, CO) > is the normalizing constant. This density satisfies, in any finite 
interval (a, co), the Pearson differential equation (3.1) with p\ = —x, pi = \ +x 2 , while 
its integral over unbounded intervals diverges. This density does not fulfill (1.1) and thus, 
it does not belong to the Integrated Pearson family - again there does not exist a correct 
choice for p2. 

The algorithm is justified because of the following propositions. 

Proposition 3.1. Let X ~ / and assume that the density / satisfies the assumptions of 
Step 0. If X ~ IP(/i;g) then the polynomials p\ and p2 of Step 3 are of degree at most one 
and two, respectively, and q(x) = 0p2(x) for some 0^0. 

Proof. Since X is Integrated Pearson, Y = XX + c is also Integrated Pearson for all A ^ 
and c e R; see Proposition 2.1(vi). Also, its density fy(x) = m/(^jp) satisfies, by 
assumption, the differential equation 

fy(x) _Pi(x) „ ~y, N _,~ fx-c\ ~Y ( ,_' l 2~( X - C 



fr(x) p\ 



x 



x e (a, co), with p\ (x) = Xp l , p\ (x) = X l p 2 



where (a,co) = (Xa + c,Xco + c) or (Xco + c,Xa + c), according to X > or X < 0, 
respectively. It is easily shown that the new polynomials p\,p2 (those that the algorithm 
produces at Step 3 for f) are related to the corresponding polynomials p\, p\ (those that 
the algorithm produces at Step 3 for fy) by the relationships 

p\(x) = X i p 1 (^-\ p Y 2 (x) = X i+1 p 2 

for some i E {1,2,3}. Therefore, it suffices to show that deg(pf ) ^ i, i = 1,2, and that 
the quadratic polynomial qy(x) = X 2 q(^) of Y is related to p\ through qy(x) = Qp\(x) 
for some 6^0. Thus, without any loss of generality we may assume that / is one of the 
densities given in Table 2.1. 
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Now observe that (pi , P2) is always irreducible for types 1,4,5 (Normal-type, Student- 
type, Reciprocal Gamma-type) with deg(pi) = 1 for all types 1,4,5, while deg(p~2) = 
for type 1 and deg(p~2) = 2 for types 4 and 5. Since the corresponding supports are R, IR 
and (0,°°), respectively, and since in type 5, pi{x) = Ox 2 for some ^ 0, it follows that 
{p\-,Pi) = {p\iPi), <1= 0p2 for some 0^0, and the assertion follows. 

For types 2, 3 and 6 (Gamma-type, Beta-type and Snedecor-type) the irreducibility of 
p\ and p2 depends on the parameters. Let us see these cases separately. 

If /~r(a, A) with a ^ 1 (a > 0, X > 0)then p\ — 0(a—l — Xx) and p~2 = Oxfor some 
^ 0, so that p\,P2 are irreducible with degree one. It follows that p, = pi, deg(p ; ) = 1 
(i = 1 , 2) and 

/ \ x p 2 (x) 



x ex ' 

If / ~r(l,A.) (X > 0) then all possible choices for {p\,pi) are given by p\ = — X6(x + c) 
and p2 — 9(x + c) for ^ 0, c E R. Therefore, Step 3 yields (pi,p 2 ) = (— A0x, Ox) and, 
thus, deg(pi) = 1 (i = 1,2) and 

x p 2 {x) 

q{x) = x = ^e- 

If / is of type 6 and b^\ then 

(p\(x),p 2 (x)) — (c((b— 1) — (a + l)x),cx(x + d)) for some c ^ 0; 

here the parameters are a,Z?, with a > I, b > and > 0. It follows that (pi,p 2 ) = 
(pi,p 2 ), deg(pj) = i (i =1,2) and 

q (x)- X{x+0) - P1{X) 



a — I (a — l)c 

If / is of type 6 with £ = 1 then all possible choices for (pi,P2) are given by 

p\{x) = -c(a+ l)(x+y) and p 2 (x) = c(x+ B)(x + y) for some c ^ 0, ye R. 

Therefore, Step 3 yields (pi,P2) = (—c(a+l)x 1 cx(x + 0)) and, thus, deg(p ; ) = i (i =1,2) 
and 

_ 4x+e) _ P2 (x) 

a — 1 (a — ljc 

Finally, let / be of type 3 (Beta-type), that is, / ~ B(a,b) with a, b > 0. If a ^ 1 and 
Z? ^ 1 it is easily shown that 

(pi(jc),^)) = (0(a-l-(fl+i-2)jc),0j:(l-jc)) (0^0) 

are irreducible, so that {pi,pi) = (pi,P2), deg(p,) = i (i =1,2) and 

x(l-x) p 2 (x) 



q(x) 



a + b (a + b)9' 
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If a = 1, b ^ 1, the most general form of (pi,p 2 ) is given by 

(pi(x),p 2 (x)) = (-(b-l)d(x + c),d(l -x)(x + c)), where 6/0,cGR. 

Therefore, Step 3 yields (j>\,p2) = (—(b — l)9x, 6x(l — x)) and, thus, deg(p,-) = / (/ = 1,2) 
and 

b+1 ~ (b+l)0- 

If a 7^ 1, b = 1, the most general form of {p\,pi) is given by 

(pi(x),p2(x)) = ((a-l)0(x+c),6x(x+c)), where ^ 0, c e E. 

Therefore, Step 3 yields {px.pi) = — 1)0(1 — x),0#(l — jc)) and, thus, deg(p,-) = z 
(i = 1 , 2) and 

JC(1 — JC) P2(x) 
9W fl+1 (fl+l)0" 

Finally, if a = = 1 (standard uniform density, C/(0, 1) = B(l, 1)) then pi = so that 
(pi,P2) = (0,x(l -x)), deg(pi) < 0, deg(p 2 ) = 2 and 

x(l-x) p 2 (x) 

?w = 



2 2 

This subsumes all cases and completes the proof. □ 

Proposition 3.2. Assume that X ~ / where the density / is differentiable with deriva- 
tive /' in its (known) interval support (a,co) and has finite (unknown) mean. Then, the 
following are equivalent: 

(A) / satisfies (3.1) for some (real) polynomials p\ (of degree at most one) and pi^O 
(of degree at most two) with P2{a) = if a > — °°, p 2 ( ft>) = if ft>< °° and p% (x) ^ 
for all xe (a, ft)). 

(B) X ~ lF(n;q) for some #(x) = 8x 2 + j3x + y with {x : q(x) > 0} = (a, ft)) and some 
/i g (a, ft)). 

Moreover, if (A) and (B) hold, then there exists a constant 6 =£0 such that q(x) = 8p2(x), 
jcgR. 

Proof. Assume first that (B) holds. Since / ~ IP(/i;g), (2.1) shows that f'/f = p\/p2 
where p\ = /I — x — q' and p2 = q. Putting the polynomials p\ = /I — x — q' and p 2 = q 
in Step of the above algorithm and using Proposition 3.1 we conclude that the resulting 
polynomials p\ and P2 (of Step 3) satisfy the requirements of (A); also, q(x) = 0p2(x) for 
some 0^0. 

Assume now that (A) holds. Using a suitable mapping Y = XX +c, A ^ 0, c 6 R, we 
can transform the interval (a, ft)) into (a, ft)), where (a, ft)) is one of the intervals (0, 1), 
(0,°o) or (—00,00). The polynomials p\ and p 2 are transformed to p\{x) = Xp\{^-) and 
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p\ (x) = X 2 p2(^j £ ), and the differential equation (3.1) yields f Y {x)/ fy{x) = p\{x)jp\ (x), 
a < x < CO, where fy is the density of Y and (5, co) its support. Moreover, it is easy to 
see that p\ and p\ satisfy the requirements of (A), i.e., p\ (oc) = if a > — °°, p\ (co) = 
if CO < 00 and p\ (x) 7^ for all x E (S, co). Clearly, in view of Proposition 2.1(vi), it 
suffices to verify that Y is Integrated Pearson. Thus, from now on (and without any loss of 
generality) we shall assume that (a, co) is one of the intervals (0, 1), (0,°°) or R. 

If (a,co) = (0,1) then the assumptions (A) show that piix) = 6x(l — x) for some 
0^0. Let p\{x) = ao + a\x. Solving (3.1) we get 

f(x) = Cx a °/ e (l -*)-( a o+ a i)/ e , < x < 1, 

where, necessarily, 1 + ao/0 > and 1 — (ao + a\)/9 > 0. Thus, 

(l+a o /0) + (l-(ao + ai)/0) = (20-ai)/0 >0, 

so that 20 -a\ ^ 0. It follows that / ~ B(a,b) with a = 1 + a /6, b = l- (a + ai)/G 
and, therefore, 

= x ( l ~ x ) _ -x) = p 2 (x) 
q[X) a + b 2-ai/O 2Q-a{ 

Assume that (a, co) — (0,°°). Then, assumptions (A) show that the possible forms of 
P2 are either p 2 = Ox or p 2 = 6x 2 or p 2 = 6x(x + c) for some 6^0 and c > 0. If P2 = Ox 
set p\ = ao + a\x and solve (3.1) to obtain 

f(x) = Cjc ao/e exp(aix/0), x > 0, 

where, necessarily, ao/6 > —1 and a\/Q < 0; thus, X ~ r(a, A) with a = ^ — 1 > and 
A = — ^ > 0. Therefore, a\ 7^ and 

?W = T 



A — ai 

If /»2 = $* 2 > set pi = ao + aix and solve (3.1) to obtain 

f(x) = Cx ai/e exp(-a /(6x)), jc> 0, 

where, necessarily, ao/$ > and a\/9 < —2; these conditions are necessary and sufficient 
for Jq f(x)dx and f^xf(x)dx to be finite. Therefore, f(x) = Cx~ a ~^e~^' x , x > 0, where 
a = — 1 — ^->1 and A = ^ > 0. Observe now that / is of Reciprocal Gamma type 

(type 5) and q(x) = Since a = -1 - ^ > 1 it follows that ~ a '- 2e > and, finally, 
ai+26 ^0. Thus, 

, s _ * 2 0x 2 
^ W ~~ a-\ ~ -a x -26 ~ -a x -2Q' 

Assume now that P2 = 6x(x + c), 6 7^ 0, c > and let p\ = ao + a\x. Solving (3.1) we 
obtain 

a o a l c ~ a 

f(x) = Cx^(x + c)~^~ t x>0, 
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where, necessarily, ^ > — 1 and ^ < —2; these conditions are necessary and sufficient for 
Jq f(x)dx and jQxf(x)dx to be finite. Now observe that f(x) = Cx b ~ l (x + c)~ a ~ b (x > 0) 
is of Snedecor-type (type 6) with a = — — 1>1 and b = 1 + ^ > 0. From ^ < -2 we 
get ai + 26 ^ and, thus, we conclude that (see Table 2.1) 

x(x + c) x(x + c) 6x(x + c) P2(x) 

q(x) 



a— I —2 — ai/d —a\—29 —a\ — 2Q 

Finally, assume that (a, go) = R. In this case assumptions (A) imply that either p2 = 
e ^ or p 2 = ±(6(x-c) 2 + X) with > 0, A > and c e R. Assume first that p 2 = ^ 
and let pi = ao + a\x. Then, it is easily seen from (3.1) that 

/W=Cexp(|ix 2 + ^x), xeR. 

This can represents a density if and only if ^ < 0; in this case it is easily seen that 
/ ~ N{n, a 2 ) with n = ^i,o = and thus, 

q(x) = a = = — — . 

a\ —a\ 

For the last remaining case it suffices to consider 

Pi{x) = 0(x — c) 2 + X and p\(x) =ao + ai(x — c) where 9 > 0, X > and ao,ai,c e R. 

Also, using the transformation X i-tX — c, the general case is simplified to p2 = Ox 2 + X 
and p\ = ao + a\x. Now, the differential equation (3.1) has the general solution 



fix) =C(0JC 2 +A)2eexp 



go 

/el 



tan- 1 (jc v / 0M) 



The necessary and sufficient condition for this / to represent a density with finite mean is 
— — 1 > or, equivalently, a\ + 26 < 0. Therefore, setting 

9 X an m 
> 0, 7= — > and ju = ^— e R 



-ai-20 ' -ai-20 -fli-20 

we see that this is a Student-type density (type 4); see Table 2.1. Consequently, 



q(x) = 8x + J 



2 , 9x 2 + X p 2 (x) 



—a\—2Q —a\—2d 
and the proof is complete. □ 

Eventually, Proposition 3.2 says that for a particular choice of p 2 to be correct it is 
necessary and sufficient that p 2 remains nonzero in (cu,co) and vanishes at all (if any) 
finite endpoints of (a, co). 

If the mean /i is known, then another simple criterion for an ordinary Pearson variate 
to belong to the Integrated Pearson family is provided by the following proposition. 
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Proposition 3.3. Let X be a random variable with density / and finite mean /!. Assume 
that the set {x : f(x) > 0} is the (bounded or unbounded) interval J(X) = (oc,G)) and that 
/ is differentiable in (a, CO) with derivative f'(x), a < x < CO. Then the following are 
equivalent: 

(A) X~w(ji',q). 

(B) The density / satisfies (3.1) and the polynomials p\ (p\ = is allowed) and p 2 can 
be chosen in such a way that (i) and (ii), below, hold: 

(i) there exist a constant 0^0 such that p\ (x) + p 2 (x) = (jU — x) /&, jteR, and 

(ii) either lim^ a p 2 (x)f(x) = or lim^ ffl p 2 (x)f(x) = 0. 

If (i) and (ii) are true then the polynomials p 2 and q are related through q{x) = 9p 2 (x) 
where ^ is as in (i). Moreover, if (3.1) is satisfied in an unbounded interval (a, CO) 
then (ii) is unnecessary since it is implied by (i). 

Proof. If X ~ IP(/i;g) then we see from (2.1) that (3.1) is satisfied for the polynomials 
p\{x) = fl —x — q'(x) and pi{x) = q(x). With this choice of p\, p 2 , Proposition 2.1 shows 
that (i) (with 9 = 1) is valid. Also, (ii) reduces to p 2 (x)f(x) = q(x)f(x) — > as x /* CO 
or x \ a; this follows by an obvious application of dominated convergence since the 
mean exists and, by assumption, p 2 (x)f(x) = q(x)f(x) = J«(jU — t)f(t)dt - see (1.1). 
Conversely, (3.1) and (i) imply that [Qp 2 {t)f{t)}' = (/i-t)f(t), a <t < CO. Integrating 
this equation over the interval [jc,y] C (a, Co) we get 

f\n - t)f(t)& = e P2 (y)f(y) - 0p2(x)f(x), a<x<y<co. (3.2) 

Now, let us take into account the first assumption in (ii), \im x \ j(X p 2 (x)f(x) = 0. Taking 
limits in (3.2) and using dominated convergence for the l.h.s. we conclude that 

f (n-t)f(t)dt=0 P2 (y)f(y), a<y<co; 

J a 

that is, X ~ IP(/i;g) with q(x) = 6p 2 (x). Clearly we get the same conclusion if we use the 
second assumption in (ii), \im y/ A (0 p2(y)f(y) = 0, and evaluate the limits as y /* CO in (3.2); 
in this case we get the identity f®(t - pi)f(t)dt = 6p 2 (x)f(x) = q(x)f(x), a < x < CO, 
which is equivalent to (1.1), since /^(/i — t)f(t)dt = 0. 

It is clear that, in the presence of (i), both assumptions in (ii) are equivalent. In fact, 
(3.2) shows that both limits yvm y/ A (S) p 2 (y)f(y) and lim x \ ia p 2 (x)f(x) exist (in R) and are 
equal. Indeed, 

0p2(y)f(y) = P2 (x)f(x) + f (ji - 0/(0*, a<x<y<co, 

Jx 

and the existence of the first moment implies that, as y CO, the r.h.s. has the well-defined 
finite limit C(x) = 6p 2 (x)f(x) + — t)f(t)dt; the l.h.s, however, is independent of x 
and, certainly, the same is true for its limit, so that C(x) = C. In other words, 

6p 2 (x)f(x) =C+ (t~n)f(t)dt, a<x<co, 

Jx 
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and since lim^ f®(t - p)f(t)dt = f„(t — p)f(t)dt = we conclude that 

C 

lim p 2 (x)f(x) = lim p 2 {y)f{y) = - G E. 

It remains to verify that if (3.1) holds in an unbounded interval (a, co) and X has finite 
first moment then (i) implies (ii). To this end assume that co = °° so that J(X) = (oc,°°) 
with a G [—oo,°o). It follows that f'(x) = pi(x)f(x) / p 2 (x) does not change sign for large 
enough x, and thus, f'(x) < for x > xq. Therefore, for x > max{2xo, 0}, 

8 f x 8 f x 8 t'°° 

< 3? f{x) = -f{x) / tdt<- tf(t)dt < - / tf(t)dt 0, 

3 Jx/2 3 Jx/2 3 Jx/2 

as x — > oo 5 i.e. f(x) = o(x~ 2 ) as x — > 00. Thus, pi{x)f{x) — > as x — > 00. The case a = —00 
is similar and the proof is complete. □ 



4 Are the Rodrigues-type polynomials orthogonal in the ordinary 
Pearson system? 

Associated with any Pearson density / is a (unique) sequence of polynomials, defined by a 
Rodrigues-type formula. Actually, these polynomials are by-products of the pair (pi,P2) 
that appears in the nominator and the denominator of the differential equation (3.1); that 
is, they have nothing to do either with / or with the interval (a, 00). 

These considerations will become more clear if we slightly relax the form of differen- 
tial equation (3.1) and permit more solutions, as follows: 

Definition 4. 1 . Let ^ (a, co) C R, and consider a pair of real polynomials (pi, pi) = 
(ao + aix : bo + b\x + b2X 2 ) such that P2^0 (i.e., \bo\ + \bi\ + > 0). The pair (pi,P2) is 
called Pearson-compatible in (a, co), or simply compatible, if there exists a differentiable 
function / : (a, co) — > R, / ^ (/ is not assumed nonnegative or integrable), such that the 
following generalized Pearson differential equation is fulfilled: 

p 2 (x)f(x)=p l (x)f(x), a<x<co. (4.1) 

In other words, (pi,P2) is compatible if (4.1) has non-trivial solutions for /. 

It is easily seen that {p\,P2) is compatible whenever p 2 has no roots in (a, co); in this 
case, the general solution / is C TC (a, co) and can be chosen to be strictly positive in (a, co). 
The presence of a zero of p% in (a, co), however, may results in incompatibility; e.g., in 
the interval (a, co) = (—2,2) the pair {p\,P2) = (4x,x 2 — 1) is compatible, in contrast to 
the pair (p\,p2) = (x,x 2 -\). 

If {p\,P2) is compatible in (a, co) then we can find the general solution as follows: 
First we solve (4.1) separately in any open subinterval of (a, co) R {x : p 2 {x) 7^ 0}; clearly, 
there are at most three subintervals and, in the worst case, the three general solutions for 
the distinct intervals (Ji,J 2 ,J3) = ((ce,pi), {p\,p 2 ),{p 2 , co)) will be of the form f = Cie 8i 
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for some gj £ C°°(7,), z = 1,2,3, with C, being arbitrary constants. Next, we match the 
solutions and their first derivatives at the common endpoints of any two 7 ; ; any such point 
is, necessarily, a zero of p2- The compatibility of (pi,P2) guarantees that this procedure 
will success in producing some solution / ^ (in which case, |/| ^ will be also a 
non-trivial solution), but it may happen that fj = in some The following proposition 
describes all possible cases for the support of /. 

Proposition 4.1. Assume that the function / : (a, co) — > E, / ^ (not necessarily 
positive or integrable) is differentiable in (a, ft)) and satisfies the differentiable equa- 
tion (4.1) for some real polynomials pi(x) = ao + a\x and pi{x) = bo -\-b\x-\-b2x 2 with 
\bo\ + \h\ + \b 2 \ > 0. Then, the support of /, S(f) := {x £ (a, ft)) : f(x) ^ 0}, is either of 
the form (a, ft)) C (a,(o) with a < a < ft) ^ ft), or of the form (a, pi) U (p2, ft)) C (a, co) 
with a ^ a < pi ^ P2 < ft) ^ ft), or, finally, of the form (a,pi) U (pi,P2) U (p2, ft)), with 
a < pi < P2 < ft). Moreover, the boundary of S(f) is contained in the set {a, ft)} U {jc £ 
(a, ft)) : = 0}, that is, dS(f) C {a, co} U {jc £ (a, ft)) : p2 W = 0}. Finally, for any 
solution /, f(p) = (that is, p ^ whenever p is a zero of p 2 which is not a zero of 

Pi- 

Corollary 4.1. The differential equation (4.1) has a nontrivial and nonnegative solu- 
tion if and only if the pair (pi,p2) is compatible in (a, ft)). Moreover, assuming that 
ip\iP2) is compatible in (a, ft)), it follows that: 

(a) any nonnegative solution is of the form |/| for some solution /; 

(b) the support S(f) = {x £ (a, ft)) : f(x) ^ 0} of any nontrivial solution / of (4.1) is a 
union of one, two or three disjoint open intervals of positive length, and the same is 
true for any nonnegative and nontrivial solution; 

(c) the boundary points of S(f) = S(\f\) of any nontrivial solution / of (4.1) are either 
roots of p2 or boundary points of (a, ft)). 

We now turn to the corresponding Rodrigues polynomials. It is well-known that 
the (generalized) Pearson differential equation (4.1) produces a sequence of polynomi- 
als {hk^k = 1,2, . . .}, defined by a Rodrigues-type formula, as follows: 

Theorem 4.1 (Hildebrandt (1931), p. 401; Beale (1941), pp. 99-100; Diaconis and 
Zabell (1991), p. 295). Assume that a function / : (a, co) — > R, (not necessarily positive 
or integrable) does not vanish identically in (a, ft)) and satisfies the differential equa- 
tion (4.1) for some polynomials p\{x) = ao + aix and p 2 (x) = bo + b\x + b2X 2 , with 
|^o| + |^i I + \b 2 \ > 0. Then, the set {x £ (a, co) : f(x) ^ 0} contains some interval of 
positive length and the function 

h(x) := -^^-[p k 2 (x)f(x)i x£(a,ft))\{x:/(x) = 0}, /c = 0,l,2,... (4.2) 
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is a polynomial (more precisely, hk is the restriction in (a, CO) \ {x : /(x) = 0} of a poly- 
nomial hk : R — > R) with 

deg(%)<fc and lead (A*) = [(ai+jb 2 ), it = 0,1, 2,..., (4.3) 

y=A:+l 

where lead (hk) := lim^oo A)t(jc) /jr denotes the coefficient of x k in hk(x). 

Hildebrandt (1931) actually showed that the relation pif = P\f implies that D k \p\f \ = 
hkf, k = 0,1,2,..., where the polynomials hk (with deg(%) ^ k) are defined inductively. 
Each polynomial hk can be viewed as the value of a functional Mk that maps any pair 
( P \,pi) to a real polynomial of degree at most k. The form of this functional is 

(puP2)^mPi,P2):=hk = ^C a k }^( Pl ) r ( P ' 2 )\ P2 y 

r,i,j 

where the sum ranges over all integers r, j ^ with r+ i + 2j ^ k, and the constant C^j 
depends only on k, r, i, j,p\ = a\ and p 2 ' = 2b 2 . On the other hand it is clear that, given an 
arbitrary pair {p\,P2) with p 2 ^ 0, we can fix an interval (a, (o) containing no roots of p 2 . 
With the help of a positive solution / of the differential equation (4.1) we can determine 
hk(x), a < x < CO, using the Rodrigues-type formula (4.2). Obviously, this hk extends 
uniquely to h k . 

To give an idea about the nature of the polynomials in (4.2) we expand the first four: 
ho = 1; 

h\ =Pi+P 2 = (ai+2b 2 )x+(a + bi); 
h 2 = p\ + 3p 1 p' 2 + p\p 2 + 2p 2 p 2 + 2(p 2 ) 2 

= (fli + 3b 2 ) (a\ + Ab 2 )x 2 + 2(a + 2b\ ) {a\ + 3b 2 )x 
+ (a + b l )(ao + 2bi)+bo(ai+4b 2 ); 

h 3 = pl + 6plp 2 + 3pipiP2 + %P\P2P2 + 1 lpi (p'2) 2 + 7p'\P2P2 + I&P2P2P2 + 6 (P2) 3 
= (fli + 4b 2 ) (fli + 5b 2 ) (fli + 6b 2 )x 3 + 3 (a + 3b\ ) (a\ + 4b 2 ) (a\ + 5b 2 )x 2 
+ 3 (oi + 4b 2 ) [(flo + 2bi ) Oo + 3b l )+b (a l + 6b 2 )]x 
+ a 3 + 6albi + a [1 lb\ + b Q {3a x + 16b 2 )] + b\ [6b\ + b (lai + 36b 2 )]. 

Provided that the solution / of (4.1) is a probability density in (a, ft)), the polynomials 
hk are candidate to form an orthogonal system for /. Indeed, Hildebrandt (1931), pp. 
404-405, showed that each % satisfies a specific second order differential equation in 
(a, ft)). Using this differential equation Diaconis and Zabell (1991) proved that the hk 
are eigenfunctions of a particular self-adjoint, second order Sturm-Liouville differential 
equation; thus, their orthogonality with respect to the density / is a consequence of the 
Sturm-Liouville theory. Specifically, it is shown in Theorem 1 of [9] (see p. 295) that each 
polynomial hk satisfies the equation 

[f(x) P2 (x)hk(x)]' = k(a 1 + (k+l)b 2 )f(x)h k (x), a < x < ft), k = 0,1,2,... . (4.4) 



22 G. Afendras, N. Papadatos 



An adaption of the Diaconis-Zabell approach to the present general case reveals that the 
orthogonality is valid only when a number of regularity conditions is satisfied. It will 
be proved here that these regularity conditions consist of an equivalent definition of the 
Integrated Pearson system. In fact, it will be shown that the Rodrigues polynomials (4.2) 
are orthogonal with respect to the corresponding density / if and only if this / belongs 
to Integrated Pearson family, provided that we have chosen a correct p2 in the differential 
equation (4.1), i.e. provided that p2 = q/9 for some 6^0. We mention here that, even 
for Integrated Pearson densities, a wrong choice of p2 results in non-orthogonality of the 
Rodrigues polynomials; see, e.g., the polynomials h k = P? given in [9], p. 297, for the 
Beta-type density f(x) = Cx N , < x < xq. In light of Proposition 3.2 (and Table 2.1), a 
correct choice for this density is given by p2 = x(xq — x) . 

In order to discuss the orthogonality of h k we first show the following lemma. 

Lemma 4.1. Let / be a density satisfying (4.1) and for fixed k,m E {0, 1, . . .}, k ^ m, 
consider the polynomials h k and h m , given by (4.2). Assume that 

(a) The density / process a suitable number of moments so that f® \h k (t)h m (t) \f(t)dt < °°; 

(b) ai + (k + m+ \)b 2 ^0; 

(c) hm{p 2 (x)f(x)[ti k (x)h m (x)-h k (x)ti m (x)]} = lim{p2(x)f(x)[h' k (x)h m (x)-h k (x)h' m (x)}}. 

x/-(£) x\a 

Then, 




h k (x)h m (x)f(x)dx = 0. 



[We shall show that, under (a) and (b), both limits in (c) exist (in R), but it is not guaran- 
teed that they are equal; in fact, their difference equals to (k — m){a\ + (k + m+ 1)^2) x 
J£h k (t)h m (t)f(t)dt.] 



Proof. Multiply both hands of (4.4) by h m , interchange the roles of k and m and subtract 
the resulting equations to get 



Xh k (t)h m (t)f(t) = h m (t) [f(t)p2(t)h' k (t)}' - h k {t) [f(t)p 2 (t)h' m (t)l a < t < co, (4.5) 



where A = (k — m)(a\ + (k + m+ 1)^2) 7^ 0, by (b). Now, it is easy to verify the Lagrange 
identity: 

{ [f(t)p 2 (t)h> k (t)] h m (t) - [f(t) P 2(t)h' m (t)]h k (t)}' 

=h m m{t)P2m{t)}'-hm{t)P2mr{t)}'- 

Thus, integrating (4.5) over [x,y] C (a,Co), and in view of (4.6), we conclude that 

y h k (t)h m (t)f(t)dt = lp2(y)f{y)[h' k (y)h m (y)-h k (y)h' m (y)} 

- jP2(x)f(x)[h' k (x)h m (x)-h k (x)h' m (x)]. 
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Therefore, taking limits as x \ a and y /* (O and using (a) and (c) we get the result. 
Working as in the proof of Proposition 3.3 it is easily seen that both limits in (c) exist in 
R, whenever (a) and (b) hold. In fact, it is true that under (a), 



[CO 

(k-m)(ai + (k + m+l)b 2 ) / h k (t)h m (t)f(t)dt 

J a 



= hm{p 2 (y)f(y)[h' k (y)h m (y)-h k (y)h' m (y)]} (4.7) 

y/-(0 

- ^{P2(x)f(x)[h' k (x)h m (x)-h k (x)h' m (x)]}. □ 

The following result is an immediate consequence of Lemma 4. 1. 

Theorem 4.2. Let / be a density in (a, co) which satisfies (4.1). For some (fixed) n £ 
{1,2, . . .} consider the set M'n := {/zo,/zi, • • • ,h n }, formed by the first n + 1 polynomials in 
(4.2). Then the set Ji^ l is an orthogonal system (containing only non-zero elements) with 
respect to / if and only if the following conditions are satisfied: 

(i) The density / process In — I finite moments; 

(ii) rT)= 2 («i +./W0; 

(iii) lim x ^ co x J p 2 (x)f(x) =]xm x \ ja x i p2{x)f{x) for each j £ {0, 1, . . . ,2n - 2}. 

Proof. Let X ~ / and assume first that (i)-(iii) are satisfied. Condition (ii) shows, in 
view of (4.3), that deg(%) — k for all k £ {0,1,..., n}. Fix k,m £ {0,1,..., n} with 
m^k. Since E|X| 2 ' 1-1 < oo by (i), it follows that E\hk(X)h m (X)\ < °o, i.e. the integral 
J® hk(x)h m (x)f(x)dx is (well-defined and) finite. Finally, since hih m — h]fi' m is a polyno- 
mial of degree k + m— 1 (observe that lead (h! k h m — hkh'„^) = (k— m)lead(/^)lead(/z m ) ^0), 
(iii) ensures that assumption (c) of Lemma 4.1 is also fulfilled and, hence, 



hk(x)h m (x)f(x)dx = 0. 

Conversely, assume that the set J% = {ho, hi, . . . ,h n } is orthogonal with respect to 
/; that is, E\h k (X)h m (X)\ = f£\h k (x)h m (x)\f(x)dx < °° for all k,m £ {0,1,..., n} with 
m ^ k, and f® h k (x)h m (x)f(x)dx = 0. It follows that, necessarily, deg(^) = k for all 
k = 1 , 2, . . . , n; for if k is the smallest integer in { 1 , 2, . . . , n} for which lead (h k ) = then 
we can write h k (x) = T!}=o c j n j( x ) f° r some constants cj, and this implies that 



h 2 k (x)f(x) 
Subsequently, the inequality 



k-l 

£ Cjhj{x)h k (x) 



\hj(x)h k (x)\f(x). 

j=o 



k-l 

E 





[■CO 


\ C j\ 


la ' 
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shows that % e L 2 (a, co) and, finally, 

/■CO k—1 rtO 

/ hl(x)f(x)dx=Y,Cj h k (x)hj(x)f(x)6x = 0, 

J a y_Q .7 a 

by the orthogonality assumption. Since is continuous (a polynomial) and / is positive 
in a subinterval of (a, co) with positive length, it follows that % = 0, which contradicts 
the assumption that contains only non-zero elements. Therefore, n£=ol eac K%) 0, 
and (4.3) yields (ii). Obviously, E\h n (X)h n -\(X)\ < oo is equivalent to E|X| 2 " -1 < oo and 
(i) follows. Since g^ m = h' k h m — hkh' m is a polynomial of degree exactly k + m — 1 (for 
k 7^ m), we can form a linearly independent set 

{go,£i,-- £2/7-2} Q {gun ■ k.m = 0. l,...,n, k^m}, 

with deg(g ; ) = j for each j. Applying (4.7) inductively to g ,g u . . .,g 2n -2 we get (iii). 

□ 

Example 4.1. It may happen that % = for all k ^ 1. For instance consider the den- 
sity f{x) = C/x, 1 < x < 2; this density satisfies (4.1) with (puPi) = (— l,x). Although 
fl hkh m f = for m ^ k, the trivial system = { 1 , 0, . . . , 0} is not considered as orthog- 
onal in this case. Condition (ii) of Theorem 4.2 eliminates such trivial cases. 

Example 4.2. The density f(x) = \x 2 , — 1 <x< 1, satisfies (4.1) in (a,co) = (—1,1). 
The choice (p\,P2) = (2,jc) leads to constant polynomials, (k+2)l/2. A set h m } 
can never be orthogonal; this explains that condition (b) of Lemma 4.1 is necessary. On the 
other hand, the choice (pi, pi) = (2x,x 2 ) yields the polynomials = c^xr with = (2k + 
2)\/(k + 2)\. The limits in Lemma 4.1(c) are \c^c m {k — m) and jCkC m (k — m)( — l) k+m+l ; 
they are equal if and only if k + m is odd, in which case and h m are, obviously, orthog- 
onal. Clearly, any set containing three (or more) polynomials cannot be an orthogonal 
set. 

Remark 4.1. While the density / of Example 4.2 satisfies the (generalized) Pearson 
differential equation (4.1) and has finite moments of any order, the system {/zo,/zi,/?2} 
fails to be orthogonal. The same is true for the Pearson density 

fix) = . = , -oo <a<x<co<°°. 
VI +x 2 

Now {p\,P2) = ( — jc, 1 +x 2 ) and {/?o,/zi,/?2} = {l,x, 3 + 6x 2 } so that /?o/z2 ^ 3 and the 
system {/zo, /zi , ^2} cannot be orthogonal (with respect to any measure). Does this happen 
because these / lie outside the Integrated Pearson family? In other words, it is natural to 
state the following question: 

If a density / has finite moments up to order 2n—\ (for some fixed n ^ 2) and 
satisfies (4.1), and if the system {/zo,/?i, . . . ,h n } of the first n+ 1 Rodrigues 
polynomials is orthogonal with respect to /, does it follow that this / belongs 
to the Integrated Pearson family? 
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The answer is in the affirmative. In particular, the following result holds. 

Theorem 4.3. Assume that a differentiable density / with S(f) = {x : f(x) > 0} C 
(a, Gt)) has finite third moment and satisfies (4.1). Let ho = l,hi,fi2 be the first three 
Rodrigues polynomials given by (4.2), consider the system Jify = {^o, h\ , h{\ and assume 
that is non-trivial, i.e., h\^0 and h 2 ^ 0. If the system J#2 is orthogonal with respect 
to / then there exists a subinterval (a ; , (O 1 ) C (a, co), a quadratic polynomial 

#(x) = <5x 2 + /3x + y, with {x: q{x) > 0} = (a',G)'), 

and a number /i G (a', ft)') such that / ~ IP(/i;g) = IP(/i; 5, /3, 7). Moreover, there exists 
a constant 0^0 such that q(x) = 6p2(x), x G R. 

Proof. In view of Theorem 4.2 and the fact that / has finite third moment, the orthogonal- 
ity assumption is equivalent to 



and 
where 



(a l +2b 2 )(a l +3b 2 )(a l +4b 2 ) ^0 (4.8) 

L J {a)=L j (a>), j = 0,1,2, (4.9) 
Lj(a) := limx^M/M, Lj(co) := lim x j p 2 (x)f(x). (4.10) 



To simplify cases we can apply an affine transformation x Ax + c (A ^ 0, c G R) to /. 
By considering /(x) = wjfi^j^) in place of / it is easily seen that (4.1) is satisfied in the 
translated interval (a, a) for p\{x) = Api(^) and P2(x) = X 2 p2{^j £ )\ since a\ = a\ 

and b\ = b2, (4.8) remains unchanged. Obviously / has finite third moment if and only 
if / does. Moreover, it is easily seen from (4.2) that the translated polynomials are 
related to by %(x) = X k h^{^j^); thus, lead(%) =lead(%) and, in particular, the system 
^2 is non-trivial if and only if the same is true for the system M2 '■= {ho,h\,h 2 }. The 
orthogonality of the system Jfy with respect to / is equivalent to the orthogonality of the 
system M2 with respect to /; indeed, 

h k (x)h m (x)f(x)dx = X k+m / h k (x)h m (x)f(x)dx. 

J a 

It remains to verify that (4.9) are equivalent to Lj(a) = L/(ft>) (j = 0, 1 , 2), where Lj(a) := 

]im x \ftx-ip2(x)f(x), Lj(co) := lim^gx^ (*)/(■*)■ To this end, it suffices to observe the 
relations 



£(g W( »)={ 



k -L;(a), if A <0. 
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Thus, it is easily seen that Lj(oc) = Lj(co) (j = 0, 1,2) if and only if Lj(a) = Lj(co) 
(J = 0,1,2). 

It is clear from the above considerations, and in view of Proposition 2.1 (vi), that we can 
freely apply any affine transformation, either to the polynomial p2 or to the density / and 
its support (a, co); under such transformations, the conclusions as well as the assumptions 
of our theorem remain unchanged. 

The rest of the proof is easy but tedious since we just have to examine all possible 
non-equivalent cases by solving the differential equation (4.1) in each case. We shall try 
to give a somewhat complete approach as follows: 

Assume first that deg(p2) = 2 and that its discriminant, A, is strictly negative. Ap- 
plying an affine transformation and dividing both p\ and p2 by lead(/?2) 7^ we may 
assume that p2 = x 2 + bo for some bo > 0. If pi = then, necessarily, (a,co) is finite 
and / ~ U (a, co); but in this case, h 2 (x) = 6x 2 + 4-bo ^ 4bo > cannot be orthogonal to 
ho = 1. If deg(pi) = 0, that is, p\ = ao 7^ 0, then the density 

is bounded away from zero, so that (or, co) must be again finite. Then, the assumed or- 
thogonality of J^2 fails because (4.9) shows that a = CO. Finally, assume that deg(pi) = 1 
i.e. p\ = ao + a\x with a\ 7^ 0. In this case, a\ £ {—2, —3, —4} because of (4.8). Since 

/(*) =C(x 2 + Z, )^exp (-^=)) , 

it follows that either (a, co) is finite or, otherwise, a\ < — 4 (for the third moment to exists). 
If (a,co) is finite, the assumed orthogonality fails because (4.9) shows that a = CO. If 
a > —00, (0 = 0° then the assumed orthogonality fails again from (4.9) since Lq(oc) > 0, 
^o(°°) = 0- The case a = — °°, co < °<> is similar to the previous one (we can also make 
the transformation x —x). Therefore, the unique case where Mj, is indeed orthogonal is 
when (a, co) = E. Then, 

E/ii(X) = (ai+2b 2 )n + (a + 2bi) =0 implies that /x = — — — 

—2 — a\ 

(note that Z? 2 = \,b\ = 0) and, hence, p\ + p' 2 = (— 2 — fli)(/i —x). In view of Proposition 
3.3 we see that 

f ~ IPG"! 9) Wlth M = —z and q(x) = — = w . 

— 2 — a\ —2 — a\ —2 — a\ 

Next, assume that deg(/?2) = 2 and A = 0. Applying an affine transformation and 
dividing both p\ and p2 by lead(/?2) 7^ we may further assume that p2 = x 2 . If p\ = 
then, necessarily, (a,co) is finite and / ~ U(a,co); but in this case, h 2 (x) = \2x 2 ^ 
cannot be orthogonal to ho = 1. Let deg(j!7i) = 0, that is, p\ = ao 7^ 0. With the map 
x 1 — ^ —x, if necessary, we may further translate the density to have either the form 

f(x) =Ce * , < a < x < co < «>, 
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or the form 

C\e *, 0<x<ft)<oo, 
0, -oo <; a < x < 0, 

where, necessarily, ao > in the second case. In both cases the assumed orthogonality 
fails because of (4.9). Finally, assume that deg(pi) = 1 i.e. p\ = ao + a\x with a\ ^ 0. 
In this case, a\ E" {—2, —3, —4} because of (4.8). With the map x (-)■ —x, if necessary, we 
may further translate the density to have either the form 

f(x)=Cx ai e * , 0^a<x<a>^°°, 



or the form 

/to 



Cix fll e 0< x < ft) ^ oo 
0, -oo < a < x < 0, 



where, necessarily, ao > in the second case. If ft) < oo then, due to (4.9), the assumed 
orthogonality fails for both cases. If ft) = oo and a > then we must take ai < —4 for the 
finiteness of the third moment (note that in this case, ao E R can be arbitrary since a > 0), 
but the orthogonality fails because of (4.9), since L${a) > 0, Lo(°°) = 0. In the last case 
where a ^ and ft) = oo (thus, ao > and a\ < —4) the orthogonality is indeed satisfied. 
This is so because it is easy to verify both (4.9) and (4.8). On the other hand, since we 
have assumed that ~Eh\{X) = (a\ + 2b2) jU + (ao + b\ ) = 0, it follows that /i = _^°_ 2 (note 
that Z>2 = 1, b\ = 0) and Pi+ p' 2 = (—2 — ai)Qu — x). In view of Proposition 3.3, this 
density belongs to the Integrated Pearson system with 

a p 2 (x) x 2 

11 = and q(x) 



-a 2 — 2 —2 — ai —2 — a\ 

Moreover, observe that its support, («', ft)) = (0,°°) C (a, ft)), is different than (a, ft)), 
whenever a < 0. 

Next, assume that deg(/?2) = 2 and A > 0. Applying an affine transformation and 
dividing both p\ and p 2 by lead (p 2 ) ^0 we may further assume that p 2 = x(l — x). 
Solving the differential equation (4.1) for arbitrary p\ and for all x E R\ {0, 1} we see 
that the general solution has the form 

r Ci(-jc) a (1-jc) 6 , if x<0, 
f(x) = l C 2 x A (l-x) B , if < jc < 1 , 
[ C^lx-l) 8 , if x> 1, 

where A and B are arbitrary parameters and C\,C2,C^ ^0 are arbitrary constants, not all 
zero. The restrictions on A and B depend on the interval (a, ft)) that we consider and 
the positivity or vanishing of each branch; they have to be chosen in such a way that the 
resulting function is differentiable and integrable in (a, ft)). For example, if [0, 1] C (a, ft)) 
and Ci,C2,C3 > then, in order that / is (continuous and) differentiable at the points 
and 1, we must take A > 1 and B > 1; but then it is necessary for (a, ft)) to be bounded, 
since, otherwise, the resulting / could not be integrable. The several possibilities can be 
classified according to the number of roots of p 2 that fall into (a, ft)), as follows: 
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(1) Let {0,l}n(a,fi>) = 0. Then, either (a,a>) C (0,1) or (a,ct)) C (-°o,0) or 
(oc,G)) C ( 1 , oo) . For the first case we observe that (4.9) fails whenever a > or co < 1 ; if 
(a, ft)) = (0, 1) then A> — 1, B > — 1, pi = A — (A + £)* and the orthogonality assump- 
tion yields Efci(X) = (ai+2b 2 )n + (ao + bi) = — (A + B + 2)ji + (A + 1) =0. Hence, 
Pl + p ' 2 =A + 1 - (A+B + 2)x = (A+B + 2)(n-x) and p 2 (x)f(x) =Cx A+1 (l -x) B+l ^0 
as x /• 1; thus, Proposition 3.3 shows that / ~ IP(/i;g) with 

A + 1 m h „^ - ^(*) 



* A+B + 2 HK ' A+fl + 2 



Using the map x H- —x for the second case and the map iH-x-l for the third case it 
is seen that both cases are reduced to (a, co) C (0,°°) and translate p 2 to p 2 = —x(x + 1); 
equivalently, we can take p 2 = x(x+ 1). Moreover, the general solution in this case takes 
the form 

f(x) = Cx e (x+l) x , (Ka<x<w^°o. 

If ft) < oo or a > it is easily seen that (4.9) fails. In the remaining case where (a, co) = 
(0, oo) we must have > — 1 (for integrability close to zero) and + X < —4 (for finiteness 
of the third moment). Since p\ = ao + a\x = 6 + (6 + X)x, p 2 = bo + b\x + b 2 x 2 =x + x 2 
and h\ = (a\ + 2b 2 )x+ (ao + bi) = (6 + X +2)x+ (0 + 1), the assumed orthogonality 
yields Ehi(X) = (0 + A+ 2)^ + + 1 =0; thus, p x + p' 2 = (0 + X + 2)x+(6 + 1) = 
— (9 + X +2)(jU — x) and Proposition 3.3 shows that 

/ ~ JP(ji;q) with ju = + 1 and q(x) - P2 ^ 



-(9+X+2) Iy ' -(G + X + 2) 

(2) Let {0,l}n(a,a)) = {1} or {0, 1} n (a, ft)) = {0}, that is, ^ a < 1< co^ooor 
— oo ^a<0<ft)^l. Clearly the map x i— > 1 — x translates the second case to the first one 
and leaves p 2 unchanged; thus, it suffices to consider only the first case. If < a < 1 < 
ft) < oo it is easily seen that (4.9) fails for all choices of (C 2 ,C 3 ) G {(+,+), (+,0), (0, +)}, 
where (C 2 ,C 3 ) = (+,0) means C 2 > 0, C 3 = 0, etc. If a = and 1 < co < oo then (4.9) 
fails for all choices of (C 2 ,C?) G {(+, +), (0, +)}, while it is satisfied when C 2 > and 
C 3 = 0. Similarly, if < a < 1 and ft) = oo then (4.9) fails for all choices of (C 2 ,C 3 ) G 
{(+, +), (+,0)}, while it is satisfied when C 2 = and C 3 > 0. Finally, if a = and ft) = oo 
then (4.9) is satisfied for all choices of (C 2 ,C 3 ) G {(0, +), (+,0)}, while C 2 > 0, C 3 > is 
not a permissible choice because / is not integrable. Therefore, the two distinct situations 
where orthogonality can be verified are given by 

. , f C 2 x A (l-x) B , 0<x<l, j _z* / \ f 0, a<xsCl, 

*« = u i <*<«;, and Mx) ={c>(*-i)» i<*<~; 

where C 2 > 0, A > -1, B > 1 and 1 < CO < «» for fi; C 3 > 0, 5 > 1, A + 5 < -4 and 
^ a < 1 for f 2 . Now it is easily seen that both f\ and f 2 belong to the Integrated 
Pearson family. Specifically, Proposition 3.3 shows that f\ ~ IP(/i;g) with 

u = and q(x) = — — = — — , (4.11) 

p A+fl + 2 qK 1 A + fl + 2 A+fl + 2' v ' 
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while f2 ~ IP(/i;g) with 

u = = 1 H and q(x) = = . 4.12) 

p A + fl + 2 -A-fl-2 yv ; A+fl + 2 -A-fl-2 v J 

(3) Let {0, 1} C (a, co), that is, -00 ^ a < < 1 < co ^ 00. We have to study the 
following cases: (3a): a = — °°, co = °°; (3b): — °° < a < 0, co = °°; (3b ; ): a = — °°, 
1 < co < °°; (3c): —00 < a < 0, 1 < co < °°. Clearly the map x h-» 1 — x translates the case 
(3t/) to (3b) and leaves p 2 unchanged; thus, it suffices to consider only the cases (3a), (3b) 
and (3c). 

Assume first (3a). If (C h C 2 ,C 3 ) G {(+,+,+), (+,0, +), (0, +, +)} (where, e.g., 
(Ci,C 2 ,C 3 ) = (+,0,+) means C\ > 0, C 2 = 0, C 3 > etc.) it follows that A > 1 and£> 1 
and, thus, / fails to be integrable (at a neighborhood of +°°). The case (Ci,C 2 ,C 3 ) = 
(+,+,0) is equivalent to (Ci,C 2 ,C 3 ) = (0,+,+) (by the map n-> 1 — x) and, again, / 
fails to be integrable. By the same map, the cases (+,0,0) and (0,0,+) are also equiva- 
lent. Assuming, e.g., (Ci,C 2 ,C 3 ) = (0,0, +) it is easily seen that B > l,A + 5< —4 are 
necessary and sufficient for / being integrable, differentiable at and 1 and with finite 
third moment. In this case both (4.9) and (4.8) are satisfied so that the system {/?o,/zi,/i2} 
is indeed orthogonal. Finally, if we assume that (Ci,C2,C 3 ) = (0, +,0) then, necessarily, 
A > 1, B > 1 (for differentiability of / at and 1) and it follows that the system {/?o, h\, I12} 
is indeed orthogonal, since both (4.9) and (4.8) are satisfied. 

Next, assume (3b). If (Ci,C 2 ,C 3 ) e {(+,+, +),(+, 0,+),(0,+,+)} it follows that 
A > 1 and B > 1 and, thus, / fails to be integrable. If (Ci,C 2 ,C 3 ) = (+, +,0) then A > 1, 
B > 1 and (4.9) fails. Also, if (C h C 2 ,C 3 ) = (+,0,0) then B > 1 and (4.9) again fails. 
Assuming (Ci,C 2 ,C 3 ) = (0,0,+) it is easily seen that B > l,A+5< —4 are necessary 
and sufficient for / being integrable, differentiable at and 1 and with finite third moment. 
In this case both (4.9) and (4.8) are satisfied so that the system {/zo,/?i,/z 2 } is indeed 
orthogonal. Finally, if we assume that (Ci,C 2 ,C 3 ) = (0, +,0) then, necessarily, A > 1, 
B > 1 (for differentiability of / at and 1) and it follows that the system {/?o,/ii,/i 2 } is 
indeed orthogonal, since both (4.9) and (4.8) are satisfied. 

Finally, assume (3c). If (Ci , C 2 , C 3 ) e { (+, +, +),(+, 0, +) , (0, +, +),(+, +, 0) } it fol- 
lows that A > 1 and B > 1 and (4.9) fails. By the map x H- 1 — x it is easily seen that the 
cases (+,0,0) and (0,0,+) are equivalent. Assuming, e.g., (Ci,C 2 ,C 3 ) = (0,0,+) it is 
easily seen that B > 1 is necessary and sufficient for / being integrable, differentiable at 
and 1 and with finite third moment; but then, (4.9) fails. Finally, if we assume that 
(Ci,C 2 ,C 3 ) = (0, +,0) then, necessarily, A > 1, B > 1 (for differentiability of / at and 
1) and it follows that the system {/zo,/zi,/z 2 } is indeed orthogonal, since both (4.9) and 
(4.8) are satisfied. 

Therefore, the two distinct situations where orthogonality can be verified are given by 



{0, a < x < 0, , 

C 2 A1 <x < 1, and AM = { ^ {x _ 1)B 

where C 2 > 0, A > 1,5 > 1 and-°° ^ a < 0, 1 < (Q < °°for/i; C 3 > 0,5 > 1, A+B < -4 
and —00 ^ a < for / 2 . Now it is easily seen that both f\ and f 2 belong to the Integrated 
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Pearson family. Specifically, Proposition 3.3 shows that f\ ~ IP(/i;g) with /! and g as in 
(4.11), while f 2 ~ IP(jU;<?) with /i and g as in (4.12). 

Next, assume that deg(/?2) — 1 an d, without loss of generality (by using an affine map) 
we shall further assume that p2 = x. If p\ = ao + a\x, the general solution of (4. 1) is 

f ( \ -S Cix^e^ if x < 0, 
W ~ 1 C 2 x clQ e aiX if x>0, 

where ao and a\ are arbitrary parameters and C\,C2 ^ are arbitrary constants, not both 
zero. The restrictions on ao and a\ depend on the interval (a, ft)) that we consider and 
the positivity or vanishing of each branch; they have to be chosen in such a way that the 
resulting function is differentiable and integrable in (a, ft)). Assuming that < a < ft)< °° 
we readily see that any values of ao, a\ G R are admissible but (4.9) fails. If0<OC<ft) = °° 
then either a\ = and ao < —3 (for finiteness of the third moment) or a\ < and ao £ R- 
In the first case both (4.9) and (4.8) are violated: the limits are unequal although 

f'OO 

I hj c (x)h m (x)f(x)dx = for k^m, k,m£ {0,1,2}, 
J a 

because h\ = hi = 0. In the second case, (4.9) fails. Ifoc = 0<ft)<°° then ao > — 1 
and a\ G R; it follows that (4.9) fails. Finally, if a = and ft) = °° then ao > — 1 and 
a\ < 0. In this case both (4.9) and (4.8) are satisfied and the system {ho, hi, hi} is, indeed, 
orthogonal. Also we see that ~Eh\(X) = a\fi + ao + 1 = so that p\ + p' 2 = aix + ao + 1 = 
— ai(/i — x). Now, from Proposition 3.3 it follows that 

/ ~ TP(jl',q) with n = and q(x) = — = (4.13) 

— a\ — a\ — a\ 

By the map x >-)■ — x we can transform the cases — »° ^ a < ft) ^ to the previous ones, 
since p2 = x is transformed to 772 = ~ x. It remains to investigate the cases — °° ^ a < 
< ft) ^ 00; then, necessarily, ao > 1. Assuming that — °° <a<0<ft)<°°itis easily 
seen that (4.9) fails for all choices of (Ci,C 2 ) G {(+,+), (+,0), (0, +)}. Assuming that 
a = -co, ft) = 00 we see that for / to be integrable it is necessary and sufficient that a\ < 
if C2 > and a\ > if C\ > 0; therefore, if (C\ , C2) = (+, +) then / is not integrable. The 
case (Ci,C2) = (+,0) is transformed (by * !-)■ — x) to (Q,C2) = (0, +). In the last case we 
can see that ao > 1 and a\ < are necessary and sufficient for / to be differentiable (in 
(a, ft)) = R) and to have finite third moment. As before we can easily check that both (4.9) 
and (4.8) are satisfied, that {/zo, ^1 , ^2} is orthogonal and that / ~ IP(/i;a) with \x and q 
as in (4.13). The map x 1— > — x shows that the last two cases, a = — °°, < ft) < °°, and 
—00 < a < 0, ft) = 00, are equivalent. By considering the second one we see that ao > 1 
and a\ < are necessary and sufficient for / to be differentiable (in (ce,°°)) and to have 
finite third moment. However, if {C\,Ci) G {(+, +), (+,0)} it is easily seen that (4.9) is 
violated because the limits as x \ a are nonzero. In the remaining case (CijCz) = (0, +) 
we can easily check, as before, that both (4.9) and (4.8) are satisfied, that {ho, hi, hi} is 
orthogonal and that / ~ IP(/i;a) with /! and q as in (4.13). 
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Finally, assume that deg(/?2) = or, equivalently, p 2 = 1. Then, if p\ = ao + a\x, it 
follows that 

f(x) =Cexp(aox + aix 2 /2), a<x<co. 

If the support (a, ft)) is bounded then it is easily seen that (4.9) fails. The cases — °° < 
a < ft) = °° and — °° = a < ft) < °° are, obviously, equivalent (by the map x \-t —x, which 
leaves p 2 unchanged). Assuming that — °° < a < ft) = °° we see that either a\ = 0, ciq < 
or ai < 0, ao G R; in the first case both (4.9) and (4.8) fail, while (4.9) fails in the second 
one. Finally, in the last remaining case where (a, ft)) = R, we see that, necessarily, a\ < 0. 
Then, for any value of aq £ R we check that both (4.9) and (4.8) are satisfied so that 
{/?o,/*i,/*2j is, indeed, orthogonal. Observe that, by assumption, ~Eh\(X) = a\\i +ao = 0; 
thus, pi+p' 2 = ao + a\x = — ai(/i — x). Proposition 3.3 shows that / ~ lP(/j,;q) with 

]X = ^- and ^) = ^W = J_. infacti /^iv(flo/(-ai),(l/v / =^") 2 ). 
— ai — «i —a\ 

This subsumes all possible cases and completes the proof. □ 



5 Orthogonality of the Rodrigues-type polynomials and of their deriva- 
tives within the Integrated Pearson family 

Assume that / is the density of a random variable X ~ EP(/i;#) = EP(/x; 5, j8 , y) with 
support (a, ft)). From Theorem 4.1 it follows that the function 

P k (x):=t^^[q k (x)f(x)}, a<x<co, * = 0,1,2,... (5.1) 
is a polynomial with 

deg(flt)<* and lead(P^) = f[ (1 := Cjk (5), fc = 0,l,2,... . (5.2) 

Obviously co(5) := 1, i.e. an empty product should be treated as one. 

The polynomials i\ are special cases of the polynomials defined by (4.2); in fact, 
F% = ( — l) k hk. They are particularly important because under natural moment conditions 
they are, indeed, orthogonal with respect to the density /; see, e.g., [9] (pp. 295-296), [14], 
[21], [3]. The orthogonality follows immediately from Theorems 4.2 and 4.3. Moreover, 
the polynomials Pk and their derivatives satisfy a number of useful properties that will be 
reviewed here. The first three are 

P (*) = l, 

P l (x)=x-n, (5.3) 
P 2 (x) = (1 - 5)(1 - 28)x 2 -2(1 - S)(n +/3)x + i u 2 + /3m - (1 -2S)y. 

An alternative simple proof of the orthogonality of the polynomials defined by (5.1) 
can be derived by means of the following covariance identity, which extends Stein's iden- 
tity for the Normal distribution and has independent interest in itself. 
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Theorem 5.1 ([3], pp. 515-516). Let X ~ IP(/i;5,j8,y) = IP(^;<?) with density / and 
support (a, ft)). Assume that X has 2£ finite moments for some fixed k G {1,2,...}. Let 
g : (a, ft)) R be any function such that g G C A_1 (a, ft)), and assume that the function 

dx k ~ 

is absolutely continuous in (a, ft)) with a.s. derivative g( k \ If E^*(X)|gW(X)| < °° then 
E|i\(X)g(X) | < °° and the following covariance identity holds: 

EP*(X)g(X) = E/(X)g«(X). (5.4) 

It should be noted that when we claim that h : (a, ft)) — > R is an absolutely continuous 
function with a.s. derivative h' we mean that there exists a Borel measurable function 
h! : (a, ft)) — >• R such that h! is integrable in every finite subinterval [x,y] of (a, ft)) such 
that 

J h'(t)dt = h(y) - h(x) for all [x,y] C (a, <o). 

Corollary 5.1 ([3], p. 516). LetX ~ IP(jii;<5,/3,y) = IP(/i;<?). Assume that for some 
« G {1,2, . . .}, E|X| 2 ' 7 < oo or, equivalently, 8 < j^j- Then 

2k— 2 

E[P k (X)P m (X)} = 8 k7in k\Eq k (X) J] (1 -jS) = 8 k , n k\c k (8)Eq k (X), 

j=k-\ w-5) 

fc,m G {0, 1, . . . ,«}, 

where 8 kj , n is Kronecker's delta and where an empty product should be treated as one. 

It should be noted that the orthogonality of P k and P m , k ^ m, k, m G {0, 1 , . . . , n}, re- 
mains valid even if 8 G [^Tp 2^2)' m tn ^ s case ' however, P n G"L 2 (R,X) since lead(.P„) > 
and E|X| 2 " = 00. On the other hand, in view of Corollary 2.2, the assumption E|X| 2 ' 1 < 00 
is equivalent to the condition 8 < t^tt- Therefore, for each k G {0, 1, . . . ,n} and for 
all j G {k-l,...,2k-2}, 1-jS > because {k- l,...,2fc-2} C {0, 1, . . . ,2n - 2}. 
Thus, c k (8) > 0. Since F[q(X) > 0] = 1, deg(g) ^ 2 and E|X| 2 ' 1 < 00 we conclude that 
< E^(X) < 00 for all k G {0, 1, ... ,n}. It follows that the set {fo, 0i , . . . , <j> n } C L 2 (R,X), 
where 

„ n ftW 5ff &«*(*)/(*)] 

<m*) : =- — ~ 1/2 = ~ — -772", fc = 0,l,...,n, (5.6) 



(*M5)E**(X)) 1/Z (fe!E^(X)n^ t (l - ;5) 



is an orthonormal basis of all polynomials with degree at most n. Moreover, (5.2) shows 
that the leading coefficient is given by 



lead(0jt) :=d k {n;q) 



ng 1 (i-7g ) N 1/2 

fc!E#*(X) 
c k (8) N 1 / 2 



(5.7) 



^!E^(X) 



>0, k = 0,l,...,n. 
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Let X be any random variable with E|X| 2n < °o and assume that the support of X is not 
concentrated on a finite subset of R. It is well known that we can always construct an 
orthonormal set of real polynomials up to order n. This construction is based on the first 
In moments of X and is a by-product of the Gram-Schmidt orthonormalization process, 
applied to the linearly independent system {1, x,x 2 , . . . ,x"} C L 2 (R,X). The orthonormal 
polynomials are then uniquely defined, apart from the fact that we can multiply each 
polynomial by ±1. It follows that the standardized Rodrigues polynomials 0& of (5.6) are 
the unique orthonormal polynomials that can be defined for a density / ~ IP(/i; 5, /3 , 7), 
provided that lead(^) > 0. Therefore, it is useful to express the L 2 -norm of each /\ in 
terms of the parameters <5,/3, 7 and ji and, in view of (5.5) and (5.6), it remains to obtain 
an expression for Eq k {X). To this end, we first recall a definition from [20]; cf. [10]. 

Definition 5.1. Let X ~ / and assume that X has support J(X) = (a,co) and belongs to 
the integrated Pearson family, that is, / ~ IP(/i;g) = IP(/i;5,/3,7). Furthermore, assume 
that EX 2 < 00 (i.e. 8 < 1). Then we define X* to be the random variable with density /* 
given by 

™ : =W' a<x<m - <5 ' 8) 

Since Pi = x — /!, setting k = 1 in the covariance identity (5.4) we get (see [7], [20]) 

E[(X-il)g(X)] = Cov[X,g(X)] = E[q(X)g'(X)}. (5.9) 

This identity is valid for all absolutely continuous functions g : (a,<y) — > R with a.s. 
derivative g' such that Eq(X)\g'(X)\ < °o. Thus, applying (5.9) to the identity function 
g(x) = x it is easily seen that Eq(X) = VarX = a 2 , so that (cf. [10]) 

X* ~/*(jc) = \q(x)f(x), a<x<co. 

The following lemma shows that X* is integrated Pearson whenever X is integrated Pear- 
son and has finite third moment. 

Lemma 5.1. If X ~ IP(jU;<5,/3,7) =IP(jU;g) with support J (X) = (a, a) andE|X| 3 < °o 
thenX* ~ rP(jU*;g*) with the same support J(X*) = J(X) = (a, co), 

M* = y^, and q *( x ) = J^, a<x<co. (5.10) 

Proof. From Corollary 2.2 it follows that the assumption E|X| 3 < o° is equivalent to 8 < j. 
Let X* ~ f*(x) = q(x)f(x) fEq(X) = q(x)f(x) /o 2 , a < x < CO, where o 2 is the variance 
of X. Then, it follows that 

E[Xq(X)] 



/I* = EX" 



o 2 



Define P\{x) = x- \i and Pi{x) = (x — \l) 2 — (x— n)cf(x) - (1 - 28)q{x). We have 
EP 1 (X) = and EP 2 (X) = o 2 - Cov [X,q'{X)) - (1 - 28)Eq(X) . Applying the covariance 
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identity (5.9) tog(x) = x andto g(x) = q\x) we see that EP 2 (X) = 28o 2 -E[q(X)q"(X)} = 
28o 2 -28Eq(X)=0. Also, 

E[P 1 (X)P 2 (X)} = E(X-n) 3 -E{(X-ll) 2 q'(X)]-(l-28)E{(X-ll)q(X)} 

= Cov [X, (X - n) 2 } - Cov [X, (X - n)q'(X)} -(1-28) Cov [X,q(X)] 

and, once again, (5.9) shows that E[Pi(X)P2(X)] = 0. Now observe that 

X = ( 2(l-g) 1 (l-25) P2(x) + f^8 Pl{x) ) = Say ' 

so that 

EXq(X) = Eq(X)g\X) = Cov[X,g(X)\ = E(X-fl)g(X) = m(X)g{X) 
1 ™,^„,^ , M + /3 w , 



EP 1 (X) J P 2 (X) + ^^E/' 1 2 (X) 



2(l-5)(l-25) iV ' LX ' 1-25 

It follows thatjU* = E[Xq[X)]/o 2 = (/i + /3)/(l -25). 

It remains to show that <?*(x) = g(x) /(l — 25) is the quadratic polynomial of X*, i.e. 

that 

J\n*-t)f(t)dt = q(x)f(x), xeR. 

Equivalently, it suffices to verify the identity 

r {n+P-(l-28)t}q(t)f(t)<it = q 2 (x)f(x), xeR. (5.11) 

J — oo 

Since f(x) = forx ^ (a, to) it follows that the l.h.s. of (5.11) equals to zero for x < a (if 
a > -oo). Also, if ft) < oo and* ^ ft) then the l.h.s. of (5.11) is equal to (fx + P)Eq(X) - 
(1 - 28)EXq[X) = (n +/3)cr 2 - (1 - 25)^a 2 = 0. Thus, (5.1 1) takes the form = 
whenever x £ (a, ft)). For x e (a, ft)) it is easily seen that 

(q 2 (x)f(x) - jjjL + p - (1 - 25)f }*(*)/(*)*)' 

= (q(x) ■ q(x)f(x)Y - {[1 + 15- (I- 28)x}q(x)f(x) 

= q'(x)q(x)f(x) +q(x)(n-x)f(x) -q{x)f{x){fl + P - (1 -25)*} 

= q(x)f(x) [q\x) + ([i-x)-([l + P) + (\-28)x]=q(x)f(x) [q'(x) -25x-j8] = 0. 

Thus, there exists a constant c G R such that 

[* {H+P-(l-28)t}q(t)f(t)dt = q 2 (x)f(x)+c, a<jc<ft>. (5.12) 

J — CO 

Now observe that lim^ + j8 - (1 - 28)t}q(t)f(t)dt = \im x/ ~ a q 2 {x)f{x) = 0. 

Indeed, the first limit follows from dominated convergence and the fact that Eq(X) = 
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a 2 and E[Xq{X)} = (n + P)o 2 /(\ - 25), while the second one is obvious when CO < °° 
because q(co) = and q(x)f(x) — > E(/i — X) = as x f~ CO. Finally, if CO = °° we have 
q(x)f(x) = o(x~ 2 ) as x — > °° because E|X| 3 < °° and for large enough x, 

f' DO j' CO 

x 2 q{x)f{x)=x 2 (t-n)f(t)dt^ t 2 (t-n)f(t)dt^0, asx^oo, 

J x Jx 

by dominated convergence. This shows that \im X/ ^ co q 2 (x)f(x) = in all cases. Therefore, 
taking limits as x /• co in (5.12) we conclude that c = and (5.1 1) follows. □ 

Theorem 5.2. Let X be a random variable with density / ~ IP(/i;g) = IP(/i;5,j8,y), 
supported in J(X) = (a,co). Furthermore, assume that E|X| 2 " +1 < oo (i.e. 8 < JL) for 
some fixed n G {0, 1, . . .}. Define the random variable X^ with density f k given by 

fl fc(Y) f( x ) 

/fe(x) := Eg*(X) ' a<x<W ' * = 0.1, (5-13) 
Then, ~ EP(/i^;^) with (the same) support 7(Xt) = 7(X) = (a, w), 

Mfe = ' and q k (x) = - ^fcg ' fc = 0,l,...,n. (5.14) 

Moreover, X = X, Xi = X^ = X*, X 2 = Xj* and, in general, X k = X^_ x for k G {1, . . . ,n}. 

Proof. For /: = the assertion is obvious while for /: = 1 (and thus, n ^ 1) the assertion 
follows from Lemma 5.1 since E|X| 3 < °° and, by definition, f\ = /*, /ii = /i* and q\=q* . 
Assume now that the assertion has been proved for some £ G {1 , . . . , n — 1}. Then, 

, E^(X)|X| 3 
1 ^ E^(X) 

because E|X | 2k+3 < oo since k^n — l. Therefore, we can apply Lemma 5.1 to the random 
variable X k ~ IP(^; q k ) = IP(/i fe ; j8 fe , %) obtaining X* ~ IP(/i* ; ^) = IP(jU* ; 8* , jB£, #) 
where 

* _ Vk + Pk _ Bt& + -As _ M + (fc+l)/3 _ 
1-25, 1-2^ l-2(*+l)fi 

and 

On the other hand, since E^r(X yt ) = ^ qk Qq an d X^ ~ /| we get 

fV v _ Qk{x)fk{x) _ T^2kS gg^Q _ Egg) _ (*)/(*) _ f ,x ry . v ^ r , 1 
/feW ~ IWX*) ~ MM ~ ~ Eqk+^X) - Jk+1 W> a<x< ^ 

l-2k8 E,q k {X) 

that is, X^ = X k+ i ~ ~ lP(fi k+ i;q k+ i), and the proof is complete. □ 



36 



G. Afendras, N. Papadatos 



Corollary 5.2. If X ~ IP(jU;<?) and E|X| 2 "+ 2 < oo (equivalently, if 8 < then for 
each k E {0, 1,.. .,n}, 

a, 2 := VarX, = Eq k (X k ) = J^^L- ^ (5.15) 
where qk(x) = 8kX 2 + fikX + Yk and Xk are as in Theorem 5.2. In particular, if 8 < 1 then 



2 :=VzrX = Eq(X) = ^. 

l — o 



(5.16) 



Eft(ia = VarX, = — = = ^+1)8 = 1- (2k + l)8 ' 



Proof. First observe that for any k G {0, 1, . . . ,n}, E\X k \ 2 < °° (and thus, Eq k (X k ) < <*>) 
since 8k = l _ S 2k § < 1 because 8 < j^+y ^ JFTT- ^ ote mat ^ su ffi ces t0 show only (5.16). 
Indeed, since X k ~ lP(^i k ;q k ) it follows from (5.9) (applied to the random variable X k and 
to the function g(x) = x) that a k = NaxX k = Eq k (X k ) . On the other hand, if we manage to 

show that VarX = |^ for any X ~ IP(n;q) with 8 < 1 then, by (5.16) applied to X k , we 
get 

Var Xi =^|>. 
1 - b k 

Since 

[l+kp q(x) . s 8 

1=2*5' qk{x) = T^2J8 and5k = Y^2k8 <h 
(5.16) yields the identity (5.15) as follows: 

QkM THE q{Mk) ^T^lkS) 

s 

l-2k8 

It remains to verify that VarX = o 2 = whenever X ~ JP(ji;q) and 8 < 1. To this end, 
write 

q(X) = q{n) +q'{ll) (X-fi) + S(X - pi) 2 
and take expectations to get o 2 = q(ji) + So 2 , which is equivalent to (5.16). □ 

Corollary 5.3. IfX ~IP(jU;^) andE|X| 2 " < °o for somen ^ 1 (i.e. 8 < 2^y)thenfor 
each k G {1,. .. ,n}, 

A t = A k{mq) := E/(X) = ^ ( '" 2J8> f[ , ( ft* } . (5.17) 

iVoo/ Observe that 

( 1 - 2jS)Eqj(Xj) = Eq(Xj) =^±, j = 0, 1 , . . . ,n - 1, 

where Ao := 1, qo = q,Xo =X. Multiplying these relations for j = 0, 1, . . . ,k — 1 and using 
(5.15) we get (5.17). □ 
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Remark 5.1. (a) It is important to note that the identity (5.4) enables a convenient calcu- 
lation of the Fourier coefficients of any smooth enough function g with Varg(X) < <*> (i.e., 
g E L 2 (R,X)). Indeed, if X ~ IP(/i;5,j3,y) = TP(ji',q) and E|X| 2n < oo then the Fourier 
coefficients c k = E(j>i c (X)g(X) are given by cq = Eg(X) and 

Eq k (X)gW(X) 
(klc k (8)A k (n;q)y/ 2 

where Ck(8) and Ak{pL;q) are given by (5.2) and (5.17), respectively, provided that g is 
smooth enough so that Eq k (X) \gW (X) | < oo for k G { 1 , 2, . . . , n}. 

(b) Obviously, if X ~ IP(jU; 8, j8, y) and <5 ^ (i.e. if X is of Normal, Gamma or Beta-type) 
then E|X|" < °° for all n. Moreover, since there exist an £ > such that Ee tX < °° for \t\ < 
£ it follows that the corresponding polynomials {<l>k}k=Q> given by (5.6), form a complete 
orthonormal system in L 2 (1R;X); see, e.g., [24], [6], [3]. Therefore, for smooth enough 
g with Varg(X) < °° and E# fc (X)|gW(X)| < oo for all k^ 1, the Fourier coefficients are 
given by 

Eo k (X)s^ (X) 

C - E * Ws( ^dW' * = 0,1,2,..., (5.19) 
and the variance of g can be calculated as (see [3], Theorem 5.1, pp. 522-523) 

£EVp«(X) 

,S*MS)a*(m;«) 

Furthermore, the completeness of the Rodrigues polynomials (when X ~ IP(jii;5,j3,y) 
and 5^0) enables one to write ([3], Theorem 5.2, p. 523) 

r , N , N1 " E[q k (X)g { J c) (X)]E[q k (X)g[ k) (X)] 
Cov[ gl (X), g2 (X =1 — ^ , r 2 1 JJ , (5-21) 

provided that for i = 1,2, gi e L 2 (R,X) and Eq k (X) \gf\x) \ < °° for all & ^ 1. The im- 
portant thing in (5.20) and (5.21) is that we do not need explicit forms for the polynomials; 
in view of (5.2) and (5.17), everything is calculated from the four numbers (/i; 5, /3, y) and 
the derivatives of g or gi (i = 1,2). In particular, for the first three types of Table 2.1, (5.20) 
yields the formulae 

oo 2k 

Varg(X) = £ — -E 2 g^(X), ifX -A^a 2 ), (5.22) 

fe=i K - 

oo j— i / \ 

Varg(X) = £ E¥/)(X), ifX ~r(a,A), (5.23) 

~ (a+6 + 2fc-l)r(a)r(6)r(a + 6+fc-l) , t/ ,j. /o. , 

Var * (x) = «r (a ^+*)r(t + *) ^x'(i-x)y«(x), (5 .24) 

ifX~fl(fl,i). 
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Turn now to the orthogonal polynomial system k = 0, 1 , . . . ,n}, of (5.1), obtained 
for a random variable X ~ IP(/i; <5,/3, 7) with support 7(X) = (oc, a)) and E|X| 2 " < 00 
for some n ^ 2, i.e. with 5 < 2^ry- By Lemma 5.1 the random variable X* = X\ ~ 
rP(/i! ; qi ) = TP{ni ; 5i , j8i , 7i) with 

ju + j3 , , s g(x) 
Mi = and ^i(x) - 



1-25 ' 1-25 

and has support (a, to). Since 5 < is equivalent to <5i = jA^ < 5^3 we conclude 
that E|Xi | 2 "~ 2 < 00 and, in particular, VarXi < 00. Therefore, we can define the orthogonal 
polynomial system 

{P k y, * = 0,l,...,/»-l}, 

by applying (5.1) to the density f\ and to the quadratic polynomial q\ of X\, that is (recall 
that/,M = 9 W/M/E 9 (X)) ' 

^ SS 1 ** 1 " 11 = ( 1 -2£)/C) g lf '" lf ""' (5.25) 

a<jc<co, & = 0,l,...,n — 1. 

Clearly the system {P^i; £ = 0, 1, . . . ,« — l}is orthogonal with respect to Xi, but the im- 
portant observation is that we can reobtain it by differentiating the polynomials P k (which 
are orthogonal with respect to X). In fact, the following lemma holds. 

Lemma 5.2. If X ~ lP(n;q) and E|X| 2 " < 00 for some n ^ 1 then the polynomials P k of 
(5.1) and P ky \ of (5.25) are related through 

P k+l (x)=C k (8)P kA (x), k = 0,l,...,n-l, 

where C k (8) := (k+l)(l - k8)(l -28) k . 

Proof. First we show that the polynomials P' k+l are orthogonal with respect to X\ . Indeed, 
deg(i^ +1 ) = k (for k = 0, 1 , . . . , n — 1) and for k, m G {0, 1 , . . . , n — 1 } with k < m we have 

e^ +1 (z 1 )p; +1 (z 1 ) = -2 / p4 +1 wp; +1 (^w/wdx 

to 

Pm+i W^t+i Ok (*)/(*) 



ft) 



1 

V 1 



Now observe that, in view of Lemma 2.1, 

Pm+l (x)Pj e+1 (x)q(x)f(x)\^ = 0, 
because P m +lP k+ 1 is a polynomial of degree m + k+l ^ 2n — 2 and E|X| 2 ' 1 < 00. Moreover, 
K +1 (x) ? (x)/(x)]' = P^ 1 (x) ? (x)/(x)+P^ 1 (x)(/i-x)/(x)=^ +1 0)/(x), 
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where H k+ i(x) = P' k ' +l (x)q(x) + (/i — x)P' k+l (x) is a polynomial in x of degree at most 
k + 1 < m+l. Therefore, 

E^ +1 (Xi)i£ +1 (Xi) = -^EP m+ i(X)H t+1 (Z) = 0, 

since i^+i is orthogonal (with respect to X) to any polynomial of degree lower than m+l. 
Note that the same orthogonality conditions are also valid for {P^iYIZq, that is, 

Eflt,i(Xi)P mj i(Xi) =0 for fc,me{0,l,...,n-l} with fc^m. 

Since deg(Z^ +1 ) = deg(P^i) = fc, & = 0, 1, . . . ,n — 1, the uniqueness of the orthogonal 
polynomial system implies that there exist constants C k ^ such that P' k+l (x) = C k P kt \ (x). 
Equating the leading coefficients we obtain lead(P( +1 ) = Qlead(i\ i), that is (see (5.2)), 

_ lead(i* +1 ) _ (fc+l)lead(A +1 ) _ (k+\)c k+l (8) _ (k + l)U% k (l- jS) 
k lead(P w ) lead(P M ) c^) 11^(1 - A) 



i^i(i-)ii) " n^ + i(i-j5) 



(Jfc + l)(l-JfcS)(l-25)\ □ 



Remark 5.2. We note that the recurrence (5.26) is contained in Beale (1937), eq. (2), p. 
207. Actually, Beale's recurrence (which is stated in a much different notation) is valid 
for the polynomials h k of (4.2) and for all k ^ 0; thus, orthogonality is not, at all, needed 
for deriving it. Specifically, if pi = a + aix, p 2 = bQ + b\x + b2X 2 , and if h k are the 
polynomials in (4.2) and h ky \ are the polynomials given by 

then, with Beale's notation, h k+ i(x) = P k+ i(k + l,x) and h kj \(x) = Pk(k+ 1,jc); see also 
[12], p. 401. Therefore, Beale's identity is equivalent to (cf. [4], eq. (2), p. 207) 

K+i (*) = (*+ 1) [ai + (* + 2)b 2 ]h kA (x) . (5.27) 

On the other hand, the current definition of P k and P k \ can be translated to Beale's notation 
as follows: Since X ~ IP(/i; 5,/3, 7) = IP(/i;g) we have from Proposition 2.1 that /'// = 
P\/pi with p2 = q and p\ = fl —x — q', that is, ao = /i — /3, a\ = — (1 + 25), bo = 7, 
b\ = /3 and b 2 = 8. Furthermore, 

P M W = = /(i) ^ + > 2 +1 W(x)] = (-1)*+ Vt (*) 

and 



-1)* d fe t (-1)* d* 



Mx) dx^ 1VA/lwJ q(x)f(x)dx* 



q\x 



1 



;i-25) 
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Thus, h k+1 = (-l) k+l P k+u h A = {-\) k {l-28) k P KX and (5.27) yields 

(-l) fe+1 ^ +1 = (^+l)[-(l+25) + (^ + 2)5](-l) fe (l-25)^ 1 . 

Thatis,P^ +1 = (^+l)[(l + 25)-(^ + 2)5](l-25) fe P M = (^+l)(l-^)(l-25)^ M , 
which shows that (5.26) holds for all k E {0, 1 , . . .}. 

Applying Lemma 5.2 inductively it is easy to verify the following result. 

Theorem 5.3. If X ~ IP(/i;5,/3,y) with support J(X) = (a, a) and E|X| 2 " < oo for 
some n ^ 1 (i.e. 8 < 2^ry) then 

p k+m( x ) = C { k m \8)P kjm (x), m = 1,2, . . . ,n, k = 0, 1, . . . ,n -m, (5.28) 

where 

C H (5) ;= + _ 2m5) * ^ 2 (1 _ - 5) (5 29) 

j=k+m-\ 

Here, /\ are the polynomials given by (5.1) associated with /, and P^ m are the corre- 
sponding Rodrigues polynomials of (5.1), associated with the density f m [x) = ^ q m^ , 
a < x < co, of the random variable X m ~ JP(fJ, m ;q m ) of Theorem 5.2, i.e., 

^ : = SS^^-Wl = (1 -2i)VW/W g [gt+ " W/WL (»0) 

a < x < to, £ = 0, 1, . . . ,n — m. 

Proof. Apply first Lemma 5.2 to get 

Pk+m = = {k + m){\ - (k + m- 1)5)(1 -28) k+m ~ 

Now, since P^ +m _i i are the Rodrigues polynomials of f\ we can apply again Lemma 5.2 
to X\ with 8\ = jA^. It follows that 

=^+m-2)+l.l = (^ + m-l)(l -(^ + m-2)5 1 )(l -25 1 )^+" I - 2 /\ +m _ 2 , 2 . 
Combining the above equations we see that 

Pk +m = + - + 1)5)(1 -25)^ m -^ +m _ u 

= (k + m)(k + m-l)(l - (fc + m- 1)S)(1 - (k + m-2)8 1 ) 

x (1 -28) k+m -\\ -28^-^^2.2. 

By the same argument it follows that for any j G {0, 1 , . . . , m — 1 }, 

=^ +w _y_i )+ i,i = + " (k+m-j- l)dj)(l -28jf +m -J- l P k+m ^ lJ+1 , 
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where 5/ = 5/(1 — 2 j'5). Thus, we can easily show, using (finite) induction on s, that for 
any s G {1,2, ...,m}, 



p it = { yii( k + m -j)) friV - (k+m-j-i)sj)j mV -25,)^'-^ \ Pk 



+m—s,s- 



Setting s = m it follows that (5.28) is satisfied with 

/ m— 1 \ / m— 1 \ / m— 1 \ 

cf } (5)= f u(k+m-j)\ na-^-j-i)^) f n(i-25/ + ^- i j. 

Now it suffices to observe that HJ~q (k + m-j) = ^ k+ k f* 1 , that 

m— 1 m— 1 / <S \ 

n(i-(*+»»-j-w= n i-(^+m-j-i)-— — 

_ uj- l (i-(k+m+j-i)8) _ ngg^wg 
n7= 1 (i-2j5) nj =0 1 (i-2j5) ' 

and that 

m-l m-l / s \ fc+m-j-l w-l /i ofi-L. k+m-j-l 

_ n'; =0 1 ( 1 - 2 (i+ 1 ) 5 ) A ' + '^ / " 1 _ n7=i(i-2y5)* +m - j " 



npo'd -2 7 -5)*+*-j-i n7 = 7(i -2j5)^+"^- 1 
n , ;r 1 1 (i-2j5)^-.' 



(l-2m5) 



n7 = - 1 1 (i-2j5)^-./- 1 



m— 1 



= (l-2m5) fc n( 1 - 2 i' 5 )- D 

Remark 5.3. (a) An alternative calculation of the constant C\ = cj™^ (5) can be given as 

follows. Lemma 5.2 guarantees that P k + m {x) = QA,m( x ) f° r some constant Q. Arguing 
as in the proof of Lemma 5.2 we see that Q can be derived from the corresponding leading 

P H \ 



coefficients. Indeed, since lead(/v" ! ) = Qlead(P^ m ), we get, in view of (5.2), that 



i A/r>i m )\ (k+m)li , /r) \ (k+m)\ / o\ (k+m) ! 1 - r 2fc+2m-2 / 1 -q\ 

lGad ( P k+m) = L T T^ lead (^+'») = —id^ C k+m(S) = ^IV^ill-jO) 



lead(i\, m ) lead(P fc , m ) n^-iCWSf 

^nSSi 2 i(i-^) _ ^(i-2m5)^n?a?;r- 2 i(i-^) 



k 

j=k+m— 1 
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(b) We note that the recurrence (5.28) is contained in Beale (1937), eq. (4), p. 207, al- 
though it is stated in a quite different notation there. Specifically, if p\ = ciq + a\x, 
P2 = t>o + b\x + bjx 2 , and if are the polynomials in (4.2) and % m are the polynomials 
given by 

then, with Beale's notation, % +w (jc) = Pk+ m (k + m,x) and %, m (x) = Pk(k + m,x). There- 
fore, putting q — > m, k — >• k + m, n — »■ k + m — 1, N' — >• — (1 + 25) and £>" — > 28 in eq. (4) 
of [4], we get 



4+L W = ( II (* + m - i) ( (* + m + / - 1 ) 5 - 1 ) ) %, w (x) 

v 7 (5-31) 

(k + mV I k + 2m - 2 \ 
(_l)»i+± (l- 7 -5) %, m (x), * = 0,1,2,.... 

V=*+m-l / 



On the other hand it is easy to see that Pk +m (x) = (— 1) hk +m (x) and, with p2 = q, 



l) k d k r k (-If d k 



Pk,m( X ) = * / A j t [lm( x )fm(x 



f m {x)dx k ^ mK ,Jmx n p™(x)f(x)dxk 



Pl[X) ;/£(*)/(*) 



[\-2m8y 



(-l)%, M (x) 
(l-2m5) fc 



Thus, = (-l) fc+ " 7 i\ +m , % jW = (-l) fe (l -2md) k P^ m , and (5.31) becomes 
(-l)^it = (-l) m ^±^ f * + ff 2 (1 " -2m8) k P kim , k = 0, 1, . . . ; 

K - \j=k+m-l J 

equivalently, pg> m = ^(1 -2mS)* (n ^ m 2 j (1 - /\,m, which shows that (5.28) 
holds for allfce {0,1,...}. 

(c) Krall [16], [17] characterizes the Pearson system from the fact that the derivatives of 
orthogonal polynomials are orthogonal polynomials. 

We can now adapt the preceding results to the corresponding orthonormal polynomial 
systems. Notice that the following corollary contains the main interest regarding Fourier 
expansions within the Pearson family and, to our knowledge, it is not stated elsewhere in 
the present simple, unified, explicit form. 

Corollary 5.4. LetX ~ IP(/i;<5,/3,y) = IP(/i;g) with support (a, a>), and assume that 
some fixed n ^ 1 (equivalently, 8 < ^ry)- Let {^>k}k=o be the orthonormal 
polynomials associated with X (with lead (0#) > for all k; see (5.6), (5.7)), fix a number 
m E {0, 1, . . . ,n}, and consider the corresponding orthonormal polynomials {(j>k,m}kZo> 
with lead(0^ m ) > 0, associated with 

, / x q m {x)f{x) 
X m ~ f m (x) = _ v ; A , a<x<co. 
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Then there exist constants vj^ = v[ m \n;q) > such that 

n+L( x ) = v k #,mW, a<x<(0, fc = 0,l,...,n-ra. (5.32) 
Specifically, the constants vjf 1 ^ have the explicit form 

( (k+m)\ u k+2m-2 (] _ j8] ) 1/2 

li'^W^ J • (5-33) 

where A m (ii;q) = Eq m (X) is given by (5.17). In particular, setting o 2 = YarX we have 



y/(fc+l)(l-fcg) A / (fc+l)(l-g)(l-fcg) A 

0jt+i W = W = W ^7— j <M X ) ' * = 0,l,...,n-l. 

(5.34) 

Proof. Observe that 

y/E\P k+m (X)\ 2 ^E\P k , m (X m )\ 2 

where Pk+ m an d Pk m are as in Theorem 5.3. Since 

Pk i + ) m (x)=C { k ni \8)P k , in (x), oc<x<cq, 



we conclude that there exists a constant vjf^ such that 0£+m( x ) = v f fe,s>(4 Hence, 
(«) , (*+,»)■ ( * +m)! lead(n+ "' ) 



lead(<fe, m ) lead iead(p,, m ) 

(* + m) ! lead (P k+m ) ^E\P k , n (X m )\ 2 _(k + m) ! c k+m (8) y/^~JX, 



2 

m J I 



*! lead (P k , m ) ^E\P k+m (X)\ 2 k\ c k (8 m ) ^E\P k+m {X)\ 2 

where, by (5.2), c k+m (8) = UfJ^M 1 ~J 5 ) and 

Ok— 2 2k— 2 s 

c k (s m )= n a-w= n (Wiz^) 

_ ngr, 2 i(i - (2m+ _ ngg^wg 

(l-2mS) fe (l-2m5) fc 
From (5.5) we see that E\P k+m (X)\ 2 = (k + m)\c k+m (8)Eq k+m (X) and 

,2 tl „ \ , fi ,E9*(Z)9»(Z) fc!c,(5 m )E^+ m (X) 



E|^,„ ; (^m)r = *!cjfc($n)E^&») = *M V 



E? W (Z) (1 -2m8) k Eq m (X) ' 
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Combining the preceding relations we obtain 



v 



(m) 



(k + m)lc k+m {8)y/E\P) c , m (X m )\i 
kl c k (d m )^/E\P k+m (X)\ 2 



[k + m)\ c k+m { d ) y {l _ 2m5) k mqm{x) 
kl c k (5 m )^/(k + m)lc k+m (5)Eq k + m (X) 



(k + m) I c k+m (5) ^klc k {8 m )Eq k + m {X) 
kl c k (8 m )^(k + m)lc k+ US)Eq k +'-(X)(l-2md) k Eq' n (X) 

y/(k + m)lc k+m (5) I (k + m)l 



C k+m (S) 



^klc k {8 m ){ \ -ImdfEq^X) V kl ^ m ( X ) V c k (8 m )(l-2m8y 



(k + m)l 



n2k+2m—2 
j=k+m 



: 2 i(i-j5) 



«Eg»(Z) * n^,(Wg) (1 2mg) , ^ «Eg»(Z) 



(* + m)! 



and the proof is complete. 



□ 
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